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1.  Motivation  and  Objectives 

Reconfiguration  is  likely  to  be  a  feature  of  future  generations  of  flight  control  systems.  The  main 
motivation  for  reconfiguration  is  greater  survivability,  attained  through  the  ability  of  the  feedback 
system  to  reorganize  itself  in  the  presence  of  actuator  failures  and  surface  damage.  High- 
performance  aircrafts  are  often  unstable  to  the  point  of  exceeding  the  control  capabilities  of  human 
pilots,  either  because  of  requirements  of  maneuverability,  or  to  optimize  efficiency.  Instability  will 
therefore  become  an  increasingly  common  characteristic  of  fighter  aircrafts  and  the  need  for  fault 
tolerant  control  methods  will  become  increasingly  critical.  In  addition,  the  benefits  of 
reconfigurable  flight  control  systems  extend  beyond  the  immediate  considerations  of  safety.  Since 
reconfigurable  systems  reduce  the  need  for  other  forms  of  reliability,  such  as  redundant  actuators, 
increased  maintainability  and  reduced  costs  are  expected  to  result  from  this  technology. 

In  the  past,  the  Air  Force  has  supported  a  major  R&D  program  through  the  Self-Repairing  Flight 
Control  System  (SRFCS)  program,  administered  through  Wright-Patterson  Air  Force  Base.  A  suc¬ 
cessful  flight  test  marked  the  end  of  the  first  phase  of  the  program.  The  approach  followed  in  this 
program  was  based  on  the  concept  of  failure  detection  and  identification;  a  fast  and  efficient 
method  to  categorize  the  failure  among  a  set  of  pre-planned  conditions  was  combined  with  pro¬ 
cedures  to  handle  each  of  the  conditions.  In  general,  this  approach  worked  well,  but  it  presented 
significant  problems.  For  a  large  number  of  possible  failures,  it  is  difficult  and  time-consuming  to 
carry  out  the  detection  and  classification.  In  the  case  of  flight  control,  there  are  multiple  possible 
actuator  failures  (multiple  actuators  and  multiple  failure  modes,  such  as  locked  or  floating)  and  an 
infinite  variety  of  possible  surface  damages.  In  addition,  a  problem  is  that  failure  detection  relies 
on  models  of  the  unfailed  system.  Therefore,  any  discrepancy  between  the  model  and  reality  can 
lead  to  false  detection.  The  nonlinearity  and  the  complexity  of  aircraft  dynamics  (especially  engine 
dynamics  and  aerodynamics)  make  robust  failure  detection  and  classification  a  particularly  difficult 
problem. 

This  report  describes  research  results  obtained  for  the  design  of  reconfigurable  flight  control  sys¬ 
tems  using  adaptive  methods.  In  this  approach,  the  dynamic  behavior  of  the  aircraft  is  identified  in 
real-time  and  a  controller  is  designed  automatically.  Because  failure  classification  is  not  relied 
upon,  the  resulting  system  is  expected  to  tolerate  a  larger  class  of  failures,  including  some  that  may 
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not  have  been  anticipated  originally.  Fundamental  research  problems  that  were  addressed  in  the 
course  of  the  research  include  the  design  of  multivariable  model  reference  adaptive  control  algo¬ 
rithms,  and  the  improvement  of  their  transient  and  robustness  properties.  Simulation  results  were 
obtained  using  a  detailed  nonlinear  aircraft  model.  They  demonstrate  the  ability  of  the  adaptive 
algorithms  to  quickly  readjust  trim,  to  restore  tracking  of  the  pilot  commands  despite  the  loss  of 
actuator  effectiveness,  and  to  coordinate  the  use  of  the  remaining  active  control  surfaces  to  main¬ 
tain  the  decoupling  between  the  rotational  axes.  The  report  also  discusses  methods  for  estimation 
of  uncertainty  in  dynamic  models  and  a  modified  least-squares  algorithm  with  forgetting  factor. 
This  algorithm  yields  fast  adaptation  under  normal  conditions,  but  reduced  sensitivity  to  noise 
when  signal  to  noise  ratios  are  poor. 

2.  Overview  of  Research  Results 

2.1  References 

The  research  results  are  available  in  references  [1]  to  [10].  These  references  acknowledge  AFOSR 
support  and  are  provided  in  Appendix.  Further  publications  are  expected  to  be  derived  from  [1], 
[2],  and  [3].  We  present  a  brief  overview  of  these  results. 

2.2  Multivariable  Algorithms  for  Reconfigurable  Flight  Control 

Several  problems  concerning  the  design  of  multivariable  adaptive  control  algorithms  were  investi¬ 
gated,  with  testing  of  these  algorithms  in  simulations.  Reconfigurable  flight  control  requires  mul¬ 
tivariable  strategies  because  strong  cross-couplings  usually  appear  after  actuator  failures.  A  Ph.D. 
thesis  [10],  initiated  before  the  grant,  was  completed.  In  it,  various  aspects  of  multivariable  adap¬ 
tive  control  system  design  were  addressed  and  the  results  appeared  in  conference  and  journal  arti¬ 
cles  [4],  [7],  and  [9].  In  [4],  the  problem  of  parameter  convergence  and  sufficient  excitation  was 
addressed.  Necessary  and  sufficient  conditions  were  given  on  the  spectrum  of  reference  inputs  so 
that  parameter  convergence  could  be  guaranteed.  In  the  process,  it  was  found  that  existing  algo¬ 
rithms  had  to  be  modified  to  ensure  convergence.  Without  such  modifications,  parameter  conver¬ 
gence  was  not  only  impossible  to  guarantee,  but,  further,  examples  were  found  where  instabilities 
occurred  in  the  presence  of  unmodelled  dynamics  that  could  be  tolerated  with  modified  algorithms. 

In  [7],  [9],  the  problem  of  model  reference  adaptive  control  for  multivariable  systems  with  unk¬ 
nown  high-frequency  gain  matrices  was  considered.  In  flight  control,  the  high-frequency  gain 
matrix  is  the  matrix  giving  the  forces  and  moments  provided  by  the  actuators.  This  matrix  is  likely 
to  change  drastically  after  failures.  Prior  to  our  research,  model  reference  adaptive  control  algo¬ 
rithms  required  this  matrix  to  be  positive  definite,  an  assumption  that  would  be  difficult  to  justify 
in  the  context  of  reconfigurable  flight  control.  A  solution  was  developed  that  only  required  a 
bound  on  the  norm  of  the  high-frequency  gain  matrix  to  be  known.  A  sort  of  hysteresis 
modification  was  used  and  the  stability  of  the  overall  adaptive  system  was  established  analytically. 
Examples  showed  that  the  modification  proposed  in  the  paper  was  necessary  to  guarantee  the  sta¬ 
bility  of  the  adaptive  system. 

Special  forms  of  multivariable  model  reference  adaptive  control  algorithms  were  derived  that  were 
suitable  for  reconfigurable  control  [2].  Specifically,  algorithms  were  developed  that  were  applicable 
when  full  state  information  was  available  and  when  constant  disturbances  were  present  (the  con¬ 
stant  disturbances  represented  trim  changes  after  failures).  The  control  algorithm  that  was  obtained 
had  a  nominal  structure  involving  a  constant  feedforward  gain  matrix,  a  state  feedback  gain  matrix, 
and  a  set  of  constant  biases  or  trim  values.  This  stmcture  is  highly  similar  to  existing  flight  control 
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system  laws.  In  this  case  however,  the  structure  was  augmented  with  a  procedure  to  determine  the 
parameters  adaptively. 

The  multivariable  adaptive  control  algorithms  were  evaluated  in  simulations  [2],  [3].  A 
detailed  aircraft  model  was  implemented,  which  was  the  model  used  for  the  AIAA  Design  Chal¬ 
lenge.  This  model  of  a  twin-engine  fighter  aircraft  is  a  full  nonlinear  simulation  that  includes 
models  of  actuator  dynamics  (including  limits  and  rate  saturation),  engine  dynamics,  aerodynamic 
forces,  and  atmospheric  properties.  The  model  also  allows  to  study  single  actuator  failures  and  the 
resulting  cross-couplings  between  the  longitudinal  and  lateral  motions.  Trim  biases  occurring  after 
failures  and  dynamic  variations  with  flight  conditions  are  represented  through  the  nonlinear  model. 

Using  simulated  data,  the  parameters  of  the  linearized  model  of  the  aircraft  were  identified.  A 
batch  least-squares  algorithm  was  applied  to  the  responses  of  the  original  aircraft  and  of  the  aircraft 
with  a  locked  left  horizontal  tail  surface.  The  parameters  of  a  model  reference  control  law  were 
calculated  from  the  estimated  aircraft  parameters  and  were  also  independently  estimated  using  a 
direct  input  error  algorithm.  Good  agreement  was  found  between  the  direct  and  indirect  procedures, 
with  the  direct  algorithm  requiring  somewhat  less  computations. 

Very  good  tracking  of  pitch  rate,  roll  rate  and  yaw  rate  commands  was  observed  when  the 
model  reference  control  law  was  used.  The  responses  of  the  control  law  designed  for  the  unfailed 
aircraft  were  found  to  deteriorate  significantly  after  the  failure,  but  were  restored  if  new  controller 
parameters  were  used.  This  is  illustrated  in  Figs.  1  and  2.  In  Fig.  1,  the  solid  lines  are  the  desired 
responses,  specified  by  the  reference  model.  The  dashed  lines  are  the  aircraft  responses.  On  the 
first  row  are  the  responses  in  pitch  rate  and  roll  rate  to  pitch  rate  commands.  On  the  second  row 
are  the  responses  in  pitch  rate  and  roll  rate  to  roll  rate  commands.  In  this  simulation,  the  control 
law  was  the  one  designed  for  the  original  aircraft,  although  the  actual  aircraft  had  a  locked  left  hor¬ 
izontal  tail.  The  pitch  rate  response  is  seen  to  be  significantly  less  than  the  desired  response.  The 
roll  rate  response  is  also  smaller  than  specified,  because  roll  control  is  partly  achieved  through  the 
elevators  in  this  aircraft.  There  is  a  significant  cross-coupling  from  the  pitch  rate  command  to  roll 
rate.  This  is  due  to  the  loss  of  symmetry  in  the  elevator  command  and  to  the  production  of  a  large 
rolling  moment. 

The  responses  with  the  automatically  redesigned  controller  are  shown  on  Fig.  2.  The  dashed  lines 
are  the  responses  obtained  with  a  fixed  controller,  and  the  dot-dashed  lines  with  an  adaptive  con¬ 
troller.  The  fixed  controller  was  obtained  by  taking  data  for  the  failed  aircraft  over  a  short  period 
of  time  prior  to  the  experiment.  In  the  adaptive  controller,  the  control  law  was  determined  and 
applied  in  real-time.  The  responses  show  that  tracking  of  the  commands  was  restored  to  the 
desired  values.  Cross-couplings  which  appeared  after  the  failure  (as  roll  responses  to  pitch  rate 
commands)  were  reduced  by  a  factor  of  2  with  the  fixed  controller,  and  to  negligible  values  when 
an  adaptive  control  law  was  used.  Implicit  in  these  responses  is  the  correct  identification  of  the 
trim  values  needed  to  maintain  level  flight.  This  property  is  important  since  not  only  the  dynamic 
parameters  of  the  aircraft  may  change  after  a  failure,  but  the  actuator  positions  required  to  avoid 
large  transient  responses  may  also  be  significantly  different. 

2.3  Robust  Adaptive  Algorithms 

Research  progressed  towards  improving  the  understanding  of  the  robustness  properties  of  adaptive 
control  algorithms  as  they  apply  to  reconfigurable  flight  control.  In  particular,  the  issues  of  tran¬ 
sient  performance  and  robustness  in  adaptive  systems  were  studied.  The  paper  [8]  described  a 
phenomenon  similar  to  the  burst  phenomenon  in  adaptive  systems  without  noise  or  unmodelled 
dynamics:  despite  theoretical  results  guaranteeing  stability  and  convergence  of  tracking  errors  to 


Figure  2:  Aircraft  Responses  —  Redesigned  Control  Law 

zero,  the  transient  response  of  certain  adaptive  algorithms  was  found  to  exhibit  large  peaks  even  in 
ideal  conditions.  The  problem  was  analyzed  and  was  attributed  to  slow  parameter  convergence. 
The  study  demonstrated  the  critical  importance  of  excitation  in  adaptive  systems,  and  the  need  for 
fast  and  efficient  adaptive  algorithms  (such  as  least-squares  algorithms)  in  adaptive  stabilization 
problems. 

An  instability  mechanism  was  studied  in  [5],  which  was  observed  in  an  adaptive  system 
excited  by  several  sinusoids.  In  this  case,  there  was  no  question  of  insufficient  excitation,  but  the 
adaptive  system  was  based  on  a  reduced-order  model.  The  instability  arose  even  though  the 
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adaptive  system  was  stable  when  only  one  sinusoidal  component  was  present.  In  the  paper,  the 
phenomenon  was  analyzed  and  explained  using  averaging  methods.  It  was  also  shown  that  an  input 
error  algorithm  remained  stable  and  exhibited  superior  performance.  The  paper  showed  the  impor¬ 
tance  of  the  proper  choice  of  adaptive  algorithm  for  robustness  considerations.  In  general,  it  was 
found  that  algorithms  with  indirect  adaptation  structure,  such  as  indirect  adaptive  control  algorithms 
and  so-called  input  error  model  reference  adaptive  control  systems,  were  more  robust  in  the  pres¬ 
ence  of  unmodelled  dynamics. 

Several  methods  were  studied  for  the  purpose  of  obtaining  reliable  estimates  of  uncertainty  in  the 
parameters  identified  through  a  least-squares  algorithm.  Estimates  based  on  a  stochastic  analysis, 
an  analysis  assuming  bounded  noise,  and  a  sensitivity  analysis  were  compared  in  [6].  The  error 
measures  were  tested  using  experimental  data  obtained  on  a  DC  motor.  It  was  found  that  the  sto¬ 
chastic  estimates  were  unrealistically  optimistic,  because  of  the  presence  of  correlated  noise. 
Sensitivity-based  estimates  were  found  to  more  accurately  reflect  the  uncertainty  in  the  parameter 
estimates.  The  bounded  noise  analysis  yielded  similar  results  but  required  a  priori  estimates  of  the 
noise  bounds.  A  linearized  longitudinal  model  of  an  F-16,  developed  at  Wright  Patterson  AFB, 
was  also  used  in  the  study.  The  interest  of  this  model  was  that  it  represented  a  modem,  high- 
performance  fighter  aircraft  with  unstable  dynamics.  Fairly  detailed  models  of  sensor  noise  were 
also  included.  It  was  again  found  that  the  estimates  of  uncertainty  provided  by  the  sensitivity  esti¬ 
mates  were  more  reliable  than  the  stochastic  estimates. 

2.4  Modified  Least-Squares  with  Forgetting  Factor 

Thanks  for  Mr.  Phil  Chandler  of  Wright  Patterson  AFB,  the  PI  became  aware  of  the  report  [11]  for 
the  Phase  I  SBIR  project:  "Self-Designing  Flight  Control  Using  Modified  Sequential  Least-Squares 
Parameter  Estimation  and  Optimal  Receding  Horizon  Control  Laws,'  supported  by  the  AFOSR.  In 
the  report,  a  modified  least-squares  algorithm  was  presented  and  its  advantages  for  flight  control 
reconfiguration  demonstrated.  Although  such  an  opportunity  was  not  anticipated  in  the  original 
proposal,  the  PI  investigated  the  concept  of  the  modified  least-squares  and  obtained  a  recursive 
algorithm  based  on  the  same  idea.  An  averaging  analysis  was  carried  out  that  showed  that  the  algo¬ 
rithm  exhibited  some  very  interesting  features,  including: 

•  a  variable  data  forgetting  capability,  so  that  the  memory  of  the  algorithm  was  long  when  the 

excitation  was  poor,  and  short  when  the  excitation  was  significant; 

•  a  second-order  filtering  capability,  as  opposed  to  the  first-order  filtering  of  the  original  algorithm, 

reducing  the  rate  of  fluctuations  of  the  parameters; 

•  a  stabilizing  effect  on  the  covariance  matrix  update,  similar  to  some  previously  discussed  in  the 

literature. 

The  results  were  reported  in  [1].  Fig.  3  from  the  paper  shows  the  main  two  features  mentioned 
above.  On  the  left  is  the  response  of  the  modified  algorithm.  On  the  right  is  the  response  of  the 
standard  least-squares  algorithm  with  forgetting  factor.  The  tme  parameter  varies  between  1  and  -1 
and  appears  as  a  triple  step  function.  The  smooth  curves  are  the  responses  predicted  by  the  averag¬ 
ing  analysis,  while  the  jagged  curves  are  the  responses  of  the  parameter  estimate.  The  approximate 
responses  provided  by  the  averaging  analysis  are  found  to  be  close  to  the  original  responses.  At 
time  400,  there  is  a  drop  in  excitation  (and  therefore  of  the  signal-to-noise  ratio).  The  response  of 
the  modified  algorithm  is  seen  to  slow  down  after  that,  while  the  parameter  fluctuations  remain 
limited.  For  the  standard  least-squares  with  (the  same)  forgetting  factor,  it  is  found  that  when  the 
excitation  drops,  the  fluctuations  in  the  parameters  become  very  large.  These  fluctuations  could  be 
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Figure  3:  Adaptive  Parameter  Responses 
Left;  Modified  LS  Right:  Original  LS 

reduced  by  using  a  different  forgetting  factor,  but  the  response  in  the  first  part  of  the  simulation 
would  then  be  slower.  With  the  modified  algorithm,  a  better  tradeoff  is  obtained  so  that  the  algo 
rithm  reacts  fast  when  there  is  sufficient  excitation,  yet  keeps  parameter  fluctuations  reasonable 
when  the  information  available  cannot  sustain  a  fast  adaptation. 

3.  Interactions  with  Air  Force  Labs 

During  the  period  of  the  grant,  the  PI  interacted  extensively  with  researchers  at  Wright  Patterson 
Air  Force  Base  involved  in  the  reconfigurable  flight  control  work,  in  particular  Mr.  Phil  Chandler 
and  Mr.  Mark  Meats  of  Wright  Patterson  AFB,  and  Dr.  Meir  Pachter  of  the  Air  Force  Institute  of 
Technology.  The  PI  visited  Wright  Patterson  AFB  on  the  following  occasions: 

•  An  initial  visit  at  the  start  of  the  grant  (May  29,  1992).  Shortly  after  that,  Dr.  Siva  Banda  visited 

the  PI  at  Carnegie  Mellon  University  (October  26,  1992).  Dr.  Banda  gave  a  presentation  on 
multivariable  control  methods  for  flight  control,  and  visited  the  university  facilities. 

•  The  2nd  quarterly  review  meeting  for  the  contract:  “Application  of  Multivariable  Control 

Theory  to  Aircraft  Control  Laws,”  (September  23,  1993).  The  PI  later  submitted  a  list  of 
comments  on  the  "zeroth  draft"  of  the  report  [12]  and  used  some  of  the  concepts  of  the  report 
in  this  project. 

•  The  kick-off  meeting  for  the  SBIR  Phase  II  effort  “Self-Designing  Controller”  conducted  by 

Barron  Associates  Inc.,  with  the  participation  of  Lockheed  Fort  Worth  and  Calspan  Co 
(October  17,  1994). 

Technical  discussions  with  researchers  of  Wright  Patterson  AFB  were  also  held  while  at  confer¬ 
ences,  including: 

•  the  Automatic  Control  Conference  (June  24-26,  1992) 

•  the  IEEE  Conference  on  Control  Applications  in  Dayton,  OH  (September  14-16,  1992) 

•  the  IEEE  Conference  on  Decision  and  Control  in  Tucson,  AZ  (December  16-18,  1992) 

•  the  Eglin  Guidance  and  Control  Workshop  organized  by  Dr.  Jim  Cloutier  of  Eglin  Air  Force 

Base,  in  Fort  Walton  Beach,  FL  (May  12-14,  1993) 
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•  the  Air  Force  Contractors  Meeting  in  Ann  Arbor,  MI  (May  24-25,  1993) 

•  the  American  Control  Conference  in  San  Francisco,  CA  (June  2-4,  1993) 

•  the  IEEE  Conference  on  Decision  and  Control,  in  San  Antonio,  TX  (December  15-17,  1993) 

•  the  Eglin  Guidance  and  Control  Workshop  organized  by  Dr.  Jim  Cloutier  of  Eghn  Air  Force 

Base,  in  Fort  Walton  Beach,  FL  (April  26-28,  1994) 

•  the  AFOSR  Workshop  on  Dynamics  and  Control,  in  Wright  Patterson  Air  Force  Base,  OH  (June 

13-15,  1994). 

•  the  IEEE  Conference  on  Decision  and  Control,  in  Orlando,  FL  (December  14-16,  1994). 

The  PI  organized  an  invited  session  titled;  "Identification  with  Modelling  Uncertainty  and 
Reconfigurable  Control"  that  was  presented  at  the  IEEE  Conference  on  Decision  and  Control,  m 
San  Antonio,  TX  (December  15-17,  1993).  The  session  was  organized  to  bring  together  reputed 
researchers  in  the  area  of  system  identification  and  aerospace  applications.  A  special  emphasis  of 
the  session  was  on  identification  techniques  that  account  for  modelling  uncertainty,  and  on  the  use 
of  these  methods  for  automatic  control  reconfiguration.  The  participants  included:  Prof.  L.  Ljung 
(Linkoping),  Dr.  V.  Klein  (NASA),  Prof.  G.  Goodwin  (Newcastle),  Mr.  P.  Chandler  (WPAFB), 
Dr.  D.  Bayard  (JPL),  and  Prof.  E.  Wu  (SUNY). 

Together  with  Mr.  Phil  Chandler  and  Dr.  M.  Pachter,  the  PI  also  submitted  an  invited  session  titled 
“Nonlinear  and  Adaptive  Methods  for  Reconfigurable  Flight  Control.”  The  session  was  accepted 
for  presentation  at  the  American  Control  Conference  to  be  held  in  Seattle,  WA,  on  June  21-23, 
1995.  It  was  organized  to  give  visibility  to  the  work  on  reconfigurable  control  and  to  bring  together 
researchers  working  in  various  organizations.  In  addition  to  the  organizers,  the  participants  of  the 
invited  session  include:  R.  Barron  (Barron  Assoc.),  Professor  H.  Michalska  (Me  Gill  University), 
H.  Youssef  (Lockheed  Advanced  Development  Corporation),  and  A.  Caglayan  (Charles  River  Ana¬ 
lytics). 

4.  Transitions 

The  PI  has  been  in  contact  with  Dr.  D.  Ward,  of  Barron  Associates  Inc.,  with  an  interest  in  imple¬ 
menting  the  recursive  form  of  the  modified  least-squares  algorithm  as  part  of  their  SBIR  Phase  II 
effort.  As  mentioned  earlier,  the  PI  developed  the  recursive  form  of  the  algorithm,  based  on  the 
concepts  delineated  in  BAFs  Phase  I  report.  Since  the  kick-off  meeting  for  the  Phase  II  effort,  the 
PI  has  had  several  discussions  with  D.  Ward  regarding  the  implementation  of  the  algorithm  and 
sent  him  Matlab  files  coding  the  algorithm  in  three  forms,  an  exact  form  and  two  approximate  but 
less  computationally  demanding  forms. 

The  PI  has  also  been  interacting  with  Professor  W.  Messner,  of  Carnegie  Mellon  University, 
on  a  project  for  the  control  of  tape  drives  systems.  This  project  is  carried  out  in  cooperation  with 
Datatape  (a  division  of  Kodak)  and  represents  an  interesting  dual-use  of  the  technology  for  mul¬ 
tivariable  adaptive  systems.  The  need  for  adaptation  arises  from  several  sources  including  varia¬ 
tions  of  tape  characteristics  with  temperature,  humidity,  and  tape  manufacturer,  and  uncertainty  in 
the  tape’s  position. 

5.  Personnel 

M.  de  Mathelin,  graduate  student,  completed  his  Ph.D.  thesis  [10]  under  the  grant  in  January  1993. 
J.  Groszkiewicz,  graduate  student,  carried  out  and  completed  his  M.S.  thesis  [3]  under  the  grant  in 
December  1994. 


-  8  - 


K.-C.  Choi,  undergraduate  student,  worked  under  the  grant  during  the  Summer  1993. 
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An  Adaptive  Algorithm  with  Information- Dependent 

Data  Forgetting 
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Department  of  Electrical  Engineering,  University  of  Utah 
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Abstract:  An  adaptive  algorithm  is  derived,  based  on  a 
weighted  least-squares  criterion  incorporating  a  penalty  on 
parameter  variations.  The  result  b  a  recursive  algorithm 
whose  covariance  matrix  update  b  identical  to  existing  reg¬ 
ularized  or  stabilbed  least-squares  algorithms.  In  partic¬ 
ular,  the  covariance  matrix  and  its  inverse  are  bounded 
under  weak  conditions.  The  parameter  update  equation 
is  different  from  conventional  algorithms  and  exhibits  a 
second-order  response,  as  opposed  to  the  first-order  re¬ 
sponse  of  the  standard  weighted  least-squares  algorithm. 
An  averaging  analysis  shows  that  a  linear  time-invariant 
approximation  of  the  parameter  response  gives  useful  in¬ 
formation  about  the  system.  The  approximate  system  has 
two  sets  of  poles,  with  one  set  being  the  same  as  the  one 
obtained  for  the  weighted  least-squares  algorithm.  The 
other  set  of  poles  is  close  to  the  origin  in  the  z-plane  when 
t  here  is  significant  excitation,  and  close  to  the  number  one 
when  the  excitation  vanishes.  The  results  allow  to  select 
design  parameters  so  that  the  algorithm  reacts  with  a  fast 
response  under  normal  conditions,  but  slows  down  when 
the  e.xcitation  is  insufficient  to  yield  adequate  noise  rejec¬ 
tion.  Simulations  demonstrate  on  a  simple  example  the 
properties  obtained  analytically. 


1  Introduction 

There  b  a  well-known  trade-off  in  adaptive  control  between 
the  rate  of  convergence  of  the  adaptive  parameters  and  the 
sensitivity  of  their  responses  to  noise.  Rapid  adaptation  b 
particularly  important  in  applications  such  as  flight  control 
reconfiguration  for  unstable  aircrafts.  However,  the  speed 
of  response  of  an  adaptive  algorithm  b  limited  in  practice 
by  the  amount  of  noise  and  unmodelled  dynamics  that 
must  be  tolerated. 

Least-squares  algorithms  are  generally  preferred  over 
gradient  algorithms  because  of  their  superior  convergence 
properties.  Exponential  data  weighting,  i.e.,  the  forget¬ 
ting  factor  approach,  b  the  most  common  modification  to 
the  least-squares  algorithm.  Forgetting  is  incorporated  to 
ensure  that  the  algorithm  b  able  to  track  parameter  varia¬ 
tions.  Unfortunately,  a  problem  is  that  the  response  of  the 
algorithm  is  unbounded  if  the  regressor  vector  is  not  per- 
sbtently  exciting.  This  problem  has  been  previously  ad¬ 
dressed  through  covariance  resetting  [6],  stabilized  covari¬ 
ance  updates  [4]  and  variable  forgetting  factor  approaches 

[2],  [31- 

In  [8],  a  batch  form  of  the  least-squares  algorithm  with 
forgetting  factor  was  used  in  a  study  of  flight  control  re¬ 
configuration.  The  authors  faced  an  unacceptable  trade¬ 
off:  either  the  response  of  the  algorithm  was  too  slow,  or 
the  fluctuations  due  to  the  nobe  were  too  large.  To  solve 
the  problem,  they  proposed  to  modify  the  algorithm  so 
that  a  penalty  on  the  parameter  variations  was  included 

"Research  supported  by  the  Air  Force  Office  of  Scientific  Re¬ 
search  under  grant  F49620-92-J-0386.  The  U.S.  government  has 
certain  rights  in  this  material. 


in  the  lesist-squares  criterion.  This  allowed  for  the  use  of 
a  smaller  forgetting  factor,  yielding  shorter  memory  and 
therefore  faster  response,  yet  with  acceptable  parameter 
fluctuations  in  the  presence  of  nobe.  Simulations  demon¬ 
strated  thb  property,  although  no  analysis  was  given. 

In  thb  paper,  we  use  the  same  idea  as  proposed  by 
[8],  but  instead  derive  a  recursive  algorithm.  Because  of 
differences  in  problem  formulation,  it  b  not  clear  that  the 
recursive  algorithm  is  equivalent  to  the  batch  algorithm 
studied  in  [8],  although  it  is  based  on  the  same  principle 
and  exhibits  similar  properties.  In  addition  to  real-time 
implementation,  an  advantage  of  the  recursive  form  is  that 
it  allows  for  comparison  with  the  standard  least-squares 
algorithm  with  forgetting  factor  and  for  a  more  detailed 
analysis. 

The  covariance  matrix  update  of  the  new  algorithm 
is  the  same  as  the  Levenberg-Marquardt  regularization 
method  [5],  also  a  special  case  of  the  stabilized  least- 
squares  algorithms  of  [4].  On  the  other  hand,  the  adaptive 
parameter  update  is  unconventional.  The  first-order  differ¬ 
ence  equation  b  replaced  by  a  second-order  equation.  In¬ 
sight  into  the  dynamic  properties  of  the  parameter  update 
is  obtained  through  the  application  of  averaging  methods 

I,  ir. 

It  b  found,  using  the  averaging  approximation,  that  tlie 
covariance  matrix  equation  is  exponentially  stable,  with  an 
equilibrium  matrix  that  is  bounded  and  positive  definite. 
The  parameter  update  equation  b  approximated  asymp¬ 
totically  by  a  second-order  system  with  two  sets  of  poles. 
One  set  is  the  same  as  obtained  with  a  comparable  algo¬ 
rithm  with  forgetting  factor  but  without  the  penalty  on 
the  parameter  variations.  The  second  set  of  poles  varies, 
depending  on  the  amount  of  excitation.  When  there  is 
sufficient  excitation,  the  second  set  of  poles  b  close  to  the 
origin  in  the  z-plane.  Therefore,  the  convergence  rate  b 
determined  by  the  first  set  of  poles,  i.e.,  by  the  forget¬ 
ting  factor.  When  the  excitation  b  reduced,  the  second 
set  of  poles  moves  towards  the  number  one.  Therefore,  for 
sufficiently  small  values  of  the  excitation,  the  convergence 
slows  down  and  the  effective  memory  b  increased.  Simula¬ 
tions  are  given  to  illustrate  the  results,  showing  that  con¬ 
vergence  b  fast  when  there  is  significant  excitation,  while 
fluctuations  due  to  noise  remain  acceptable  when  the  ex- 
rif.af.ion  is  DOOr. 


2  Adaptive  Algorithm 

We  consider  the  standard  problem  where,  at  dbcrete  in¬ 
stants  k,  with  k  =  1...TI,  measurements  of  a  scalar  signal 
y{k),  and  of  a  so-called  regressor  vector  w{k)  are  obtained. 
In  iderd  conditions,  these  signals  are  assumed  to  satisfy  the 
linear  relationship 


j/(fc)  =  (1) 

where  6"  is  a  vector  of  unknown  parameters,  to  be  deter¬ 
mined  by  the  adaptive  algorithm.  In  practice,  equation  (1) 
is  affected  by  noise  and  other  unmodelled  effects.  Further, 


1 


it  is  possible  that  the  vector  6'  may  vary  slowly,  or  change 
abruptly. 

At  time  n,  we  consider  the  following  least-squares  cri¬ 
terion 

n 

^[«(«)]  =  -  tn^(fc)0(rr))^A"-^ 

+  a\e{n)-e{n-lf  (2) 

For  a  =  0,  this  is  the  standard  least-squares  criterion  with 
exponential  data  weighting,  also  called  forgetting  factor. 
The  parameter  A  is  the  forgetting  factor,  typically  chosen 
between  0.9  and  0.99.  For  a  /  0,  a  weighting  term  is  added 
that  penalizes  large  variations  between  the  new  parameter 
estimate  6  (n)  and  the  previous  estimate  d(n  —  1). 

Setting  dE/d6{n)  =  0  yields 

B[n)  =  I  ^  'w(k)w^ {k)X"  ^  +  oti 

\k=i 

71 

^m(A-)!/(i:)A”“''  +oe(n  -  1)  I  (3) 

This  is  the  ’’batch”  form  of  the  adaptive  algorithm,  al¬ 
though  it  is  not  a  true  batch  estimate,  because  of  the  de¬ 
pendency  of  B{n)  on  6{n  —  1).  It  is  natural  to  define 

P(;i,)  =  -f  (4) 

For  consistency  with  standard  terminology,  we  will  call  this 
matrix  the  covariance  matrix  of  the  adaptive  algorithm.  A 
recursive  form  for  the  inverse  of  the  covariance  matrix  is 
deduced  from  (4)  to  be 

P"'(n)  =  XP~^{n  —  1)  +  w{n)w^{n)  -t-  0(1  —  A)/  (5) 

Given  (4),  the  initial  condition  for  the  recursive  equation 
is  F“'(0)  =  al.  Now,  consider  that 

n 

P~\n)0{n)  =  ^m(it)r/(fc)A"-*  -f  cte{n  -  1)  (6) 

k=l 

to  deduce  that 

P~^{n)6{n)  =  w{n)y(n)  (7) 

-t-A  (P“'(n  -  1)6>(ti  -  1)  -  ae{n  -  2))  +  ae{n  -  1) 

After  a  few  steps,  one  finds  that  a  recursive  formula  for 
the  parameter  6{n)  is 


P{n]  and  6(n)  are  expressed  in  terms  of  P{n  —  1)  and 
—  1)  are  perhaps  more  familiar  than  the  ones  shown 
here,  but  they  are  equivrJent. 

2)  For  Cf  0,  the  covariance  matrix  update  is  the  same  as 
the  Levenberg-Marquardt  regularization  method  [5],  pp. 
364-365.  Kreisselmeier  [4]  also  studied  various  stabilized 
least-squares  algorithms,  of  which  this  update  is  a  spe¬ 
cial  case.  The  adaptive  algorithm  derived  here  shares  the 
advantages  and  the  disadvantages  of  the  stabilized  algo¬ 
rithms.  The  main  advantage  is  that  both  the  matrix  P 
and  its  inverse  P~^  are  bounded  (see  section  3).  In  other 

words,  the  algorithm  does  not  sufer  from  the  drawbacks 
of  the  least-squares  algorithm  and  of  the  least-squares  al¬ 
gorithm  with  forgetting  factor,  namely  that  P  goes  to  zero 
in  the  first  case  when  there  is  excitation  (thereby  halting 
adaptation)  and  goes  to  infinity  in  the  second  case  when 
there  is  no  excitation.  On  the  other  hand,  a  drawback  is 
that  the  use  of  the  matrix  inversion  lemma  does  not  yield 
a  simple  matrix  covariance  update  because  of  the  matrix 
o(l  —  A)/  that  is  found  in  the  right-hand  side.  We  defer 
discussion  of  this  issue  until  section  5. 

3)  The  update  law  for  6{n)  is  significantly  different  from 
the  update  law  in  the  original  least-squares  algorithm  with 
forgetting  factor.  We  observe  the  addition  of  a  term  pro¬ 
portional  to  6(n  — 1)  — 0(n  — 2).  This  transforms  the  update 
law  from  a  first-order  difference  equation  into  a  second- 
order  equation.  Considerably  more  insight  will  be  gained 
into  the  dynamic  behavior  of  this  equation  using  an  aver¬ 
aging  analysis  in  section  4. 

4)  Note  that  the  parameter  a  appears  in  three  places,  that 
is,  in  both  update  laws  and  as  initial  condition  for  the 
covariance  matri.x  update.  The  initial  condition  for  the 
parameter  vector  is  arbitrary. 

3  General  Properties  of  the  Covariance 
Matrix  Update  Law 

The  update  law  for  the  covariance  matrix  (5)  possesses  de¬ 
sirable  properties,  under  very  general  conditions.  Since  the 
algorithm  is  the  same  as  the  first  stabilized  least-squares 
algorithm  proposed  in  [4]  (case  N  =  1),  the  properties 
found  there  can  be  directly  applied  here.  For  convenience 
of  notation,  we  define  R(n)  =  P~^{n).  Also,  we  allow  for 
arbitrary  initial  conditions  R{0)  =  R^ (0)  (normally  R{0) 
would  be  equal  to  al). 

Fact:  (a)  R{n)  =  R^ {n)  Vn 

(b)  R{n)  =  al  is  an  exponentirJly  stable  equilibrium  of 
the  unforced  equation  (5)  (that  is,  with  w{n)  =  0) 

(c)  R{0)  >  al  =>  R{n)  >  aJ  Vn 

(d)  R{0)  >  al  and  «;(n)  bounded  =>  R{n)  bounded 
Comment:  see  [4]  for  a  proof.  The  proof  can  also  be 
reconstructed  easily,  given  the  linearity  of  equation  (5). 
The  fact  formally  specifies  the  properties  previously  men¬ 
tioned,  among  others  that,  under  weak  conditions,  both 
the  covariance  matrix  and  its  inverse  are  bounded. 


0(n)  =  6{n  —  1)  -h  P{n)w{n)  (y(n)  —  w^(n)0(n  —  1)) 

-f  aAP(n)  (0(n  —  1)  —  ^(ri  —  2))  (8) 

Comments:  1)  Equations  (5)  and  (8)  form  the  adap¬ 
tive  algorithm.  It  can  be  checked  that,  for  o  =  0,  these 
equations  are  the  same  as  the  usual  update  laws  for  the 
least-squares  rJgorithm  with  forgetting  factor.  An  update 
law  for  the  matrix  P(n)  can  be  obtained  from  the  up¬ 
date  for  P~^(n]  by  the  use  of  the  matrix  inversion  lemma 
(see  section  5).  This  update  must  be  performed  first,  then 
the  matrix  P(n)  must  be  used  to  calculate  B(n).  Alterna¬ 
tively,  an  update  law  for  B(n)  can  be  derived  that  requires 
P(n  -  1)  instead  of  P(n).  The  update  equations  where 


4  Averaging  Analysis 

The  update  law  for  the  adaptive  parameter  is  different 
from  conventional  update  laws,  and  is  more  difficult  to 
analyze.  To  gain  insight  into  the  dynamic  properties  of 
the  overall  adaptive  system,  we  carry  out  an  averaging 
analysis.  Averaging  has  been  applied  succesfuUy  to  adap¬ 
tive  sj'stems,  including  in  the  case  where  the  dynamics  are 
nonUnear  (see  [7]  in  the  continuous-time  case  and  [l]  in 
the  discrete-time  case,  for  example).  The  assumptions  re¬ 
quired  for  the  approximations  to  be  valid  are,  in  general, 
that  the  exogeneous  signals  (that  is,  y(n),  it>(n))  are  sta¬ 
tionary,  and  that  the  states  of  the  adaptive  system  vary 
slowly  compared  to  the  exogeneous  signals.  Accordingly, 


we  require  that  u;(Tr)  be  stationary,  meaning  that  the  limit 

ko  +  t^ 

R^{n)  =  lim  17  w{k)w'^ik  +  n)  (9) 

^  '  N-^oo  fy  ' 

k~ko 

exists  for  all  n  and  uniformly  with  respect  to  to-  Of  spe¬ 
cific  interest  is  the  matrix  iZ„,(0),  which  is  symmetric  and 
positive  semi-definite.  It  is  positive  defimte  if  and  only  it 
the  regressor  vector  w  b  persbtently  exciting.  It  is  also 
the  average  of  the  matrix  tn(n)u;^(n).  ^ 

For  convenience,  we  abo  assume  that  y(n)  =  w  (n)8 
(no  noise),  and  define  the  parameter  error  vector 

^(n)=0(n)-e*  (10) 

Then,  the  update  laws  can  be  written  as 

p-i(„)  =  XP~^(n  -  1)  +  tv(n)w^(n)  -f  a(l  -  A)/  (11) 


^(n)  =  ip(n  -  1)  -  P{n)w{n)vj'^ {n)ip{n  -  1) 

4-  aXP{n)  (y>(n  -  1)  -  7>(n  -  2))  (12) 

The  system  is  nonlinear  and  time- varying,  with  the 
time  variation  originating  from  the  matrix  of  signals 
tu(n)u>^(n).  To  justify  the  averaging  approximation  (slow 
variation),  we  define  u(ti)  and  cr  through 

w{n)  =  \/e  v{n)  X  =  I  —  ea  (13) 

Then,  (11)  can  be  written  as 

p-'(,i,)  =  P~^(7i-l)-i-e  (u(n)u^(n)  -  crP^^in  -  1)  + 

(14) 

For  £  sufficiently  small,  the  response  of  this  system  can  be 
approximated  [1]  by  that  of  the  averaged  system 

-  1)  +  =  {MO)  -  -  1)  +  aa/) 

(15) 

In  terms  of  the  original  variables, 

(n  -  1)  +  ^-(0)  +  "(1  -  M  (16) 

In  summary,  we  found  that,  assuming  that  the  regressor 
vector  M;(n)  is  stationary  and  sufficiently  small,  and  as- 
suming  that  the  forgetting  factor  A  is  sufficiently  close  to 
one,  the  trajectories  of  the  system  (5)  are  arbitrarily  close 
to  those  of  the  averaged  system  (16). 

The  equation  for  p-J  is  a  linear  time-invariant  system, 
which  b  exponentially  stable.  All  the  poles  for  this  system 
are  located  at  z  =  A.  The  equilibrium  matrix  of  the  system 

(P*)-*  =  -^-(^^(0) -f-a(l  -  A)/)  (17) 

Consistently  with  the  results  of  section  3,  the  matrix 
is  bounded  if  in  b  bounded,  and  its  inverse  P 
is  also  bounded  with  P'  < 

For  the  parameter  response,  averaging  theory  does  not 
seem  to  be  directly  applicable.  However,  note  that  (11) 
and  (12)  lead  to  the  expression 

¥7(n)  =  P(n)  {XP-\n  -  l)<p(n  -  1) 

-ha¥i(n  -  1)  -  a7’(w  -  2))  (18) 

Thb  equation  defines  ^{n)  as  a  function  of  P(’0 
that  is,  without  any  direct  input  from  io(n).  In  this  case, 


we  will  define  the  averaged  system  as  the  system  obtained 
by  replacing  P(n)  by  Pc„(n)  in  (18).  Using  (16), 

ip<>v(ll)  =  —  1)  —  Pat.(n)Pti;(0)<Pat,(ll  —  1) 

-fQ'APa»(n)  (v’ao(n  —  1)  —  ipavin  —  2))  (19) 

Thb  expression  leads  to  useful  insight  into  the  dynamics  of 
the  adaptive  system.  In  section  6,  we  will  abo  demonstrate 
through  examples  the  closeness  of  thb  approximation. 

Assuming  that  the  matrix  Pav  has  converged  to  its 
steady-state  value,  and  noting  that  P  Pto(O)  =  (1  — A)/  — 
a(l  -  A)P',  the  dynamics  of  the  parameter  update  law  are 

given  by 

9av{n)  =  {XI  +  aP*)ipav{n  -  l)  -  a\P\av{n  -  2)  (20) 

This  is  a  linear  time-invariant  system.  Its  poles  are  given 
by  the  roots  of 

det((z-A/)(z-ciP*))=0  (21) 

If  the  dimension  of  the  parameter  and  regressor  vectors 
is  m,  there  will  be  2m  poles,  with  m  of  them  located  at 
z  =  A  (the  forgetting  factor).  The  remaining  m  poles  are 
located  at  the  eigenvalues  of  aP*.  However, 

aP*  =  a(l  -  A)  (P„,(0)  +  a(l  -  A)/)”'  (22) 

If  there  is  negligible  excitation  in  all  directions  of  the  pa¬ 
rameter  space,  the  matrix  Pu;(0)  will  be  small,  leading  to 
a  matrix  aP*  close  to  the  identity  and  eigenvalues  close  to 
1.  For  large  excitation,  the  matrix  Pu,(0)  will  dominate, 
leading  to  a  matrix  aP'  close  to  zero  and  eigenvalues  close 
to  0.  In  other  words,  the  convergence  will  be  slow  and 
the  memory  long  when  there  is  little  e.xcitation.  But  fast 
responses  and  short  memory  will  be  achieved  if  the  excita¬ 
tion  is  sufficiently  large.  Above  a  certain  excitation  level, 
however,  the  convergence  rate  does  not  increase  and  is  de¬ 
termined  by  the  other  poles  at  z  =  A  and  the  forgetting 

factor  A.  j  .i.  i 

The  threshold  of  excitation  can  be  adjusted  through 
proper  choice  of  the  desip  parameter  a.  In  the  scalar 
case,  for  example,  the  additional  pole  b  located 
power  in  the  signal  w  b  such  that  P„,(0)  =  j(l  —  A)  .  For 
higher  e.xcitation  leveb,  the  additional  pole  will  be  faster 
than  the  pole  at  A.  For  lower  excitation  levels,  its  response 
win  be  slower  and  will  dominate.  In  the  multivariable  case, 
a  similar  result  holds.  Further,  the  varia.ble  forgetting  p- 
pability  is  directional.  For  a  two-dimensional  system  with 
significant  excitation  in  one  direction  but  poor  excitation 
in  the  other,  one  pole  will  be  close  to  zero  and  the  other 
close  to  one  (in  addition  to  the  two  polp  at  A).  Extended 
memory  will  be  obtained  in  the  direction  of  poor  excita¬ 
tion. 

5  Implementation  Issues 

One  of  the  difficulties  with  the  implementation  of  the  al¬ 
gorithm  b  that  the  update  law  is  expressed  in  terms  of 
p-^(n).  In  the  original  algorithm  (with  a  =  0),  one  may 
use  the  matrix  inversion  lemma 

A"'  =  P-'  -f  C  (23) 

^  A  =  B  -  BC{1  +  C^BCr'C^B  (24) 

to  obtain  a  recursive  formula  for  P(n).  Because  the  matrix 
C  =  m  is  a  vector,  the  matrix  1 4-  C'^ BC  is  a  scalar  a-C'd  is 
easily  inverted.  In  the  case  of  the  modified  algorithm  (5)- 
(8),  the  matrix  inversion  lemma  could  stUl  be  used,  but 
the  matrix  O  would  have  to  be  defined  by 

C  =  (  m  ya(l  -  A)  I  )  (25) 
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Figure  1;  Parameter  Response  -  Modified  LS 

so  that  the  inversion  of  a  matrix  witli  dimension  liigher 
than  P{n)  would  be  required.  One  might  as  vvell  up¬ 
date  P~'{n)  and  invert  it  to  update  8{n)  in  tiiat  case. 
This  problem  is  well-known  and  shared  by  the  stabilized 

[4]  or  regularized  [5]  algorithms.  A  solution  proposed  by 

[5]  consists  in  replacing  a(l  —  A)/  in  the  update  law  by 
>110(1  —  A)e,c,^,  where  c,  is  a  vector  which  is  zero  every¬ 
where  except  at  the  ith  position  where  it  is  one.  in  is  the 
dimension  of  w.  As  time  progresses,  the  index  i  is  incre¬ 
mented  and  returned  to  one  when  the  end  of  the  vector  is 
reached.  With  this  modification,  the  matrix  C  becomes 

C  =  (  lu  ^ya{l  —  A)c,  )  (26) 

Through  the  use  of  the  matrix  inversion  lemma,  only  a  two- 
dimensional  matrix  needs  to  be  inverted  at  each  time  step. 
Surprisingly,  this  modification  induces  no  change  to  the 
averaged  system  definition,  since  the  matrix  ma'(l  — A)eieJ^ 
is  stationary  and  has  average  a(l  —  A)/,  :.e,  the  same  as 
the  original  matrix. 

6  Example  and  Simulation  Results 

We  consider  the  scalar  example 

y{k)  =  w(k)e' +  n{k)  (27) 

where  n[k)  is  an  additive  noise  consisting  of  independent, 
identically  distributed  gaussian  random  variables  with  zero 
mean  and  standard  deviation  0.1.  In  the  first  part  of  the 
simulations 

w{k)  =  sin(27rfc/50)  (28) 

that  is,  w[k)  is  a  sinusoidal  signal  with  magnitude  1  and 
period  equal  to  50  time  steps.  In  the  algorithm,  the  for¬ 
getting  factor  is  set  to  A  =  0.9.  This  corresponds  to 
a  memory  of  10  steps  (if  we  use  a  definition  equivalent 
to  the  continuous-time  definition  of  time  constant,  i.e., 
A^°  ~  e~*)'  The  parameter  a  is  set  to  2.45,  for  rea¬ 
sons  to  be  explained  later.  In  Figure  1,  the  response 
of  the  adaptive  parameter  is  shown.  The  true  param¬ 
eter  6’  was  equad  to  1  from  0  to  200,  —1  from  200  to 
500,  and  1  from  500  to  1000.  It  is  shown  on  the  fig¬ 
ure  as  a  discontinuous  signal.  Two  additional  responses 
are  shown:  the  adaptive  parameter  response  and  the  aver¬ 
aged  adaptive  parameter  response,  which  is  the  smoother 
curve  of  the  two.  The  proximity  of  these  two  responses 


Figure  2:  Covariance  .Matrix  Response  -  Modified  LS 
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Figure  3:  Parameter  Response  -  Original  LS 

can  be  observed  at  once,  demonstrating  the  value  of  the 
averaging  approximation.  At  time  400,  the  magnitude  of 
the  signal  w(k)  was  reduced  from  1  to  0.1.  As  a  conse¬ 
quence.  Rw{0)  was  reduced  from  0.5  to  0.005.  The  value 
of  O'  is  such  that,  for  /?u/(0)  =  0.005,  the  second  pole  at 
a(l  —  A)/(q(1  —  A)-f  /iu,(0))  is  equal  to  0.98,  or  a  time  con¬ 
stant  of  50  steps.  On  the  other  hand,  for  Ru,(0)  =  0.5,  the 
second  pole  is  located  at  0.33  and  is  therefore  much  faster 
than  the  pole  at  A  =  0.9.  The  threshold  of  excitation  such 
that  both  poles  are  located  at  A  =  0.9  is  iZui(O)  =  0.027. 
The  responses  exhibit  precisely  the  expected  convergence 
rates,  with  a  time  constant  of  approximately  10  steps  at 
t  =  200  and  50  steps  at  <  =  500. 

Figure  2  shows  the  responses  for  the  covariance  ma¬ 
trix  and  its  averaged  equivalent  (both  matrices  are  scalars 
here).  -A.gain,  the  averaging  approximation  is  a  good  pre¬ 
dictor  of  the  actual  system’s  response.  When  the  excita¬ 
tion  diminishes  at  t  =  400,  the  covariance  matrix  increases, 
but  only  up  to  appro.ximately  0.4,  which  is  close  to  the  up¬ 
per  bound  equal  to  1/0  =  1/2.45. 

Figures  3  and  4  give  the  responses  for  or  =  0.  This 
is  the  usual  least-squares  algorithm  with  forgetting  factor. 


Figure  5;  Parameter  Response  -  Original  LS 

One  finds  that,  once  excitation  diminishes,  the  parameter 
variations  due  to  the  noise  become  very  large.  This  is  due, 
in  part,  to  the  large  values  of  the  P  matrix  reaches  in  that 
case.  These  large  parameter  fluctuations  are  avoided  in 
the  modified  algorithm  because  of  the  limits  placed  on  the 
matri.x  P  through  the  parameter  o-.  In  addition,  increased 
filtering  is  provided  in  the  modified  algorithm  through  the 
change  from  a  first-order  update  law  to  a  second-order 
update  law. 

Figures  5  and  6  shows  the  response  of  the  least-squares 
algorithm  with  forgetting  factor  A  =  0.97.  This  choice 
of  forgetting  factor  leads  to  a  smoothing  of  the  parame¬ 
ter  fluctuations  in  the  second  part  of  the  response,  when 
the  e.xcitation  is  reduced.  However,  in  the  earlier  part 
of  the  response  around  i  =  200,  the  convergence  time  is 
longer  than  for  the  modified  algorithm.  In  the  original 
algorithm,  there  is  little  flexibility  to  deal  with  varying 
excitation  conditions.  A  single  choice  of  forgetting  factor 
must  satisfy  ail  conditions.  The  modified  algorithm  gives 
more  fle.xibility  and  relaxes  certain  trade-offs.  Note  that, 
in  Figure  5,  the  fluctuations  of  the  parameter  responses 
appear  to  be  of  higher  frequency  than  in  Figure  1.  This 


Figure  G;  Covariance  Matrix  Response  -  Original  LS 

IS  again  atinbutcd  to  the  difference  between  a  first-order 

and  second-order  update  law. 
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Abstract:  The  application  of  multivariable  adaptive  con¬ 
trol  techniques  to  flight  control  reconfiguration  is  consid- 
ered  The  paper  first  discusses  three  adaptation  mecha- 
nisms  for  model  reference  control.  It  is  shown  that  sim¬ 
ple  algorithms  can  be  obtained  if  full  state  feedback  is 
assumed.  The  respective  advantages  and  disadvantages  of 
the  three  algorithms  are  discussed  in  general  terms,  con¬ 
sidering  their  complexity  and  the  assumptions  that  they 
require.  Next,  the  application  of  the  adaptive  algorithms 
to  reconfigurable  flight  control  is  investigated.  The  objec¬ 
tive  is  to  design  automatically  a  flight  control  law  in  the 
presence  of  actuator  failures  or  surface  damage.  Design 
considerations  for  the  adaptive  algorithms  are  discussed 
in  this  context.  Simulations  obtained  using  a  full  nonlin¬ 
ear  simulation  of  a  twin-engine  jet  aircraft  are  included  to 
illustrate  the  results. 


1  Introduction 

Reconfiguration  is  likely  to  be  a  feature  of  future  genera¬ 
tions  of  flight  control  systems.  The  main  motivation  for  re¬ 
configuration  is  greater  survivability,  attained  through  the 
ability  of  the  feedback  system  to  reorganize  itself  m  the 
presence  of  actuator  failures  and  surface  damage.  High- 
performance  aircrafts  are  often  unstable  to  the  point  of 
exceeding  the  control  capabilities  of  a  human  pilot.  Insta¬ 
bility  follows  either  from  requirements  of  maneuverabil¬ 
ity,  or  from  efficiency  considerations,  and  it  is  expected 
to  become  an  increasingly  common  characteristic  of  future 
aircrafts.  The  need  for  fault  tolerant  control  methods  is 
therefore  critical.  The  benefits  of  reconfigurable  flight  con¬ 
trol  systems  extend  beyond  the  immediate  considerations 
of  safety.  Indeed,  reconfigurable  systems  reduce  the  need 
for  other  forms  of  reliability,  such  as  redundant  actuators. 
Therefore,  increased  maintainability  and  reduced  costs  are 
gxpected  to  result  from  this  technology  [3]. 

In  the  past,  the  Air  Force  has  supported  a  major 
RfcD  program  through  the  Self- Repairing  Flight  Control 
System  (SRFCS)  program,  administered  through  Wnght- 
Patterson  Air  Force  Base.  A  successful  flight  test  marked 
the  end  of  the  first  phase  of  this  program  [10].  The  ap¬ 
proach  followed  in  this  program  was  based  on  the  concept 
of  failure  detection  and  identification.  The  system  con¬ 
sisted  in  a  fast  and  efficient  method  to  detect  the  failure 
among  a  set  of  pre-planned  conditions,  and  in  procedures 
to  handle  each  of  the  cases.  This  approach  works  weU 
in  restricted  cases,  but  suffers  from  significant  drawbacks. 
The  first  is  that,  as  the  number  of  failures  grows,  it  be 
comes  increasingly  difficult  and  time-consuming  to  carry 
out  the  detection  and  classification.  Further,  there  is  no 
reason  to  believe  that  a  failure  that  has  not  been  catego¬ 
rized  will  not  cause  the  whole  system  to  fail.  In  the  c^e  of 
flight  control,  there  are  multiple  possible  actuator  failures 
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(multiple  actuators  and  multiple  failure  modes,  such  as 
locked  or  floating)  and  an  infinite  variety  of  possible  sur¬ 
face  damages.  In  addition,  because  failure  detection  rehes 
on  models  of  the  unfailed  system,  any  discrepancy  between 
the  model  and  reality  can  lead  to  false  detection.  Because 
of  the  nonlinearity  and  complexity  of  aircraft  dynarnics 
(especially  engine  dynamics  and  aerodynamics),  this  is  a 
nontrivial  problem  in  reconfigurable  flight  control. 

A  totally  different  approach  to  the  problem  of  flight 
control  reconfiguration  consists  in  identifying  the  dynamic 
behavior  of  the  aircraft  in  real-time,  and  in  designing  a 
controller  automatically.  Because  such  an  approach  does 
not  rely  on  failure  classification,  it  is  expected  that  the 
resulting  system  will  tolerate  a  larger  class  of  failures,  in¬ 
cluding  some  that  may  not  have  been  anticipated.  In  this 
paper,  we  discuss  several  multivari.ablc  adaptive  control 
algorithms  that  may  be  used  with  that  objective  in  mind. 
We  make  assumptions  that  are  realistic  in  the  flight  control 
problem,  yet  allow  to  considerably  simplify  the  algorithms 
available  in  the  literature.  We  also  present  the  results  of 
a  simulation  study  using  a  detailed  nonlinear  model  of  a 
twin-engine  aircraft. 

2  Reconfigurable  Flight  Control 
2.1  Aircraft  Model 

The  dynamics  of  aircrafts  can  be  accurately  represented  by 
nonlinear  differential  equations  models.  For  the  purpose 
of  flight  control,  hnearized  models  are  typically  used,  of 
the  form 

i  =  Ax  -h  Bu  y  =  Cx  (1) 

The  components  of  the  state  x  are  divided  into  the  longi¬ 
tudinal  variables:  a  (angle  of  attack),  q  (pitch  rate),  h  (al¬ 
titude),  r  (velocity),  and  the  lateral  variables:  P  (sideslip), 
p  (roll  rate),  r  (yaw  rate),  fr  (roll  angle),  i/>  (yaw  angle). 
The  control  inputs  are  also  separated  into  the  longitudinal 
control  variables:  6e  (elevator  command)  and  6t  (thrust 
command),  and  the  lateral  control  variables:  8a  (afieron 
command)  and  Sr  (rudder  command).  For  the  design  of 
stability  augmentation  systems,  the  state  vector  can  be 
reduced  to  the  five  states  or,  q,  P,  p,  and  r  with  good  ap¬ 
proximation.  The  fast  control  variables  are  also  reduced  to 
6b,  8a,  8r.  We  will  use  such  a  model  in  this  paper  (except 
for  simulations). 

The  control  outputs  y  can  be  defined  in  several  ways. 
For  low  dynamic  pressure  and  limited  angle  of  attack,  the 
variables  q,  p,  r  are  good  choices  of  output  variables  to 
be  regulated  [cf.  [7]).  The  control  problem  is  then  that 
of  a  linear-time  invariant  system  with  three  inputs  and 
three  outputs.  Because  of  the  symmetry  of  the  (unfailed) 
Eurcraft,  the  motions  in  the  longitudinal  and  lateral  axes 
can  be  decoupled  in  general,  so  that  the  problem  is  actually 
reduced  to  a  single-input  single-output  control  problem  in 
the  longitudinrd  axis,  and  a  2x2  multivariable  problem  m 
the  lateral  axis. 
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2.2  Effects  of  Failures 

After  a  failure,  a  model  similar  to  (1)  can  still  be  used  for 
control  system  design.  There  are,  however,  two  differences. 
First,  the  dynamics  are  generally  not  decoupled  after  a 
failure  because  symmetry  is  lost  (both  ailerons  will  not 
fail  together).  In  addition,  a  failure  is  likely  to  alter  the 
equilibrium  of  the  aircraft.  The  constant  values  added  to 
the  controls  (trim  biases)  then  have  to  be  modified  to  reach 
a  new  equilibrium.  We  account  for  this  equilibrium  change 

by  introducing  a  state-space  model 

i  =  Ax  +  Bu  +  d  y  —  Cx  (2) 

where  the  constant  disturbance  d  represents  the  bias  forces 
introduced  by  the  failure  or  damage.  The  lack  of  symmetry 
implies  that  the  matrices  A  and  B  will  not  necessarily  be 
block-diagonal  anymore. 

2.3  Challeirges  of  Control  Reconfiguration 
There  are  several  challenges  in  solving  the  flight  control 
reconfiguration  problem; 

•  it  is  a  multivariable  problem,  with  strong  cross-couplings 
appearing  after  failures; 

•  it  is  a  problem  suited  for  adaptation,  not  only  be¬ 
cause  the  dynamic  model  after  failure  is  unknown  but  also 
changes  with  flight  condition; 

•  modern  high-performance  aircrafts  are  unstable,  often 
leaving  little  time  for  reconfiguration; 

•  actuator  authority  is  limited  and  sensor  noise  is  signifi¬ 
cant  (high-gain  feedback  is  not  an  option); 

•  the  reconfiguration  strategy  must  be  robust  to  linear  and 
nonlinear  unmodelled  effects; 

•  the  number  of  possible  failures  is  large. 

3  Multivariable  Adaptive  Algorithms 

In  this  section,  we  discuss  some  adaptive  algorithms  that 
are  relevant  to  the  problem  of  flight  control  reconfigu¬ 
ration.  The  control  objective  that  we  consider  is  based 
on  the  well-known  model  reference  formulation.  Multi- 
variable  model  reference  adaptive  control  algorithms  are 
available  in  the  literature  (see  [8],  [9],  and  the  references 
therein).  In  the  present  paper,  we  discuss  slightly  differ¬ 
ent  algorithms.  We  incorporate  the  disturbance  d  and  we 
also  show  that  simplifications  can  be  obtained  under  the 
assumption  of  full  state  measurement.  In  that  case,  the 
"observer”  part  of  the  controllers  can  be  eliminated  and  a 
state-feedback  control  law  can  be  obtained. 

3.1  Assumptions  and  Reference  Model 
We  consider  the  state-space  model  for  the  plant  (2),  where 
r  6  R”,  u  €  jR"*,  y  €  R"',  and  d  £  R" .  We  assume  that 
the  whole  state  x  is  available  for  measurement,  although 
only  the  output  y  is  to  be  tracked.  The  objective  is  for  y 
to  match  the  output  yu  of  a  reference  model 

i/M  =  Am  VM  +  Bm  r  (3) 

where  vm  €  R"*  and  r  €  R"' ■  The  matrices  Am  and 
Bm  are  arbitrary  square  matrices  with  Am  stable.  For 
the  model  reference  control  problem  to  have  a  relatively 
simple  solution,  we  assume: 

Assumption  1  The  plant  has  relative  degree  1,  i.e., 
det(C'R)  /  0. 

Assumption  2  The  plant  transfer  function  is  minimum 
phase,  i.e.,  the  zeros  of  transmission  of  the  system  are  in 
the  open  left-half  plane. 

The  first  assumption  guarantees  that  the  closed-loop 
transfer  function  of  the  plant  can  be  made  to  match  the 
transfer  function  of  the  reference  model  (3)  using  a  proper 
compensator.  If  the  assumption  is  not  satisfied,  the  model 
reference  control  problem  may  still  be  solvable,  but  a  more 
complex  reference  model  would  have  to  be  chosen,  so  as 
to  match  the  so-called  Hermite  normal  form  of  the  plant. 


When  Assumption  1  is  satisfied,  this  Hermite  form  is  sim¬ 
ply  H{s)  =  diag  {l/s}  which  means  that  the  behavior  of 
the  plant  at  infinity  is  that  of  a  multivariable  integrator. 
The  matrix  CB  is  called  the  high-frequency  gain  matrix  of 
the  plant  and  is  usually  denoted  Rp  in  the  adaptive  con¬ 
trol  literature.  It  is  a  most  critical  parameter  for  adaptive 
control  algorithms. 

The  second  assumption  is  a  necessary  assumption  to 
guarantee  the  internal  stability  of  the  model  reference  con¬ 
trol  algorithm.  The  dimension  of  the  state-space  for  the 
reference  model  is  m,  while  the  dimension  of  the  state- 
space  for  the  plant  is  n.  Therefore,  n  —  m  modes  must 
be  made  unobservable  “or  uncontrollable.  It  can  be  shown 
that  the  model  reference  control  law  places  m  modes  of  the 
plant  at  the  desired  model  reference  locations,  and  makes 
the  others  unobservable  by  placing  them  at  the  locations 
of  the  transmission  zeros. 

3.2  Model  Reference  Control  Law 
We  consider  the  state-feedback  control  law 

u  r=  Co  r  Gqx  -{-  n  (4) 

where  Co  £  R”'*’",  Go  £  R'"'”,  v  £  R"'  are  free  controller 
parameters.  The  closed-loop  dynamics  are  given  by 

X  —  Ax  -f  BCqt  -f-  DGqX  -h  Bv  -f-  d  y  =  Ct  (5) 

or,  in  terms  of  the  output  y, 

y  =  {GA  -{-CBGo)x  aCBCot  +  CDv  ~  Cd  (6) 

(6)  leads  to  the  same  input/output  relationship  as  that  of 
the  reference  model  (3)  for  the  so-called  nominal  value.s  of 
the  controller  parameters 

Co*  =  (GB)-‘  Bm 

Go  =  (GB)-' (AmC -C.4)  (7) 

V*  =  -(CR)-'  (Cd) 

If  the  plant  wars  known,  these  would  be  the  controller  pa¬ 
rameters  that  one  would  use  to  achieve  the  model  reference 
control  objective. 

3.3  Indirect  Adaptive  Control 

An  indirect  adaptive  controller  can  easily  be  derived  using 

(7) .  Let  A,  B,  d  be  estimates  of  the  plant  parameters  A, 
B,  d.  Then,  an  error  can  be  defined  by 

ei  =  Ax  -f-  Bu  +  d  —  X  (8) 

where  ei  is  an  error  vector  which,  under  the  assumptions, 
is  equal  to 

ei  =  [A  —  a)  X  —  {^B  —  B')  u  {d  —  d)  (9) 

In  practice,  a  least-squares  algorithm  can  be  used  to  find 
the  estimates  of  A,  B,  and  d  which  minimize  the  sum  of 
squares  of  ei  evaluated  at  several  sampling  instants.  Each 
row  of  (8)  is  treated  independently  in  this  process.  Such  an 
algorithm  requires  the  measurement  of  the  derivative  of  i. 
In  practice,  the  derivative  can  often  be  obtained  by  filtered 
differentiation  of  x  (see  [l]  for  example).  In  the  context  of 
flight  control,  such  differentiation  is  not  necessary  because 
of  the  availability  of  direct  acceleration  measurements. 

The  complete  adaptive  control  algorithm  is  obtained 
by  setting  the  controller  parameters  Co,  Go,  and  d  in  (4) 
using  (7),  with  the  estimates  of  the  plant  parameters  A,  B, 
d  replacing  the  true  plant  parameters  A,  B,  and  d.  Aside 
from  the  issue  of  stability,  a  major  question  to  be  resolved 
is  what  to  do  when  CB  is  singular,  since  this  matrix  is 
to  be  inverted.  Few  methods  are  available  to  address  this 
problem  satisfactorily,  and  it  is  still  a  subject  of  current 
research . 
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3.4  Direct  Adaptive  Control  —  Output  Error 
A  totally  differeat  algorithm  can  be  obtained,  based  on 
known  properties  of  gradient  algorithms  (c/.  [8]).  The  so- 
called  output  error  co  =  y  -  VM  is  used  for  that  purpose. 
Because  the  system  is  assumed  to  have  relative  degree  1,  a 
simple  algorithm  can  be  derived  (there  is  no  need  for  the 
so-called  augmented  error).  We  use  the  following  facts. 
Fact  1  The  output  error  eo  satisfies 

eo  =  {si  —  Am)~^  (C!B)[{Co  —  Co)r 

+  (Go -g;)  I +  («-«*)] 

Proof:  Dedne  XM  =  Amxm  A  BCgr  +  BGqXm  +  Bv  +d, 
so  that  yM  =  OxM  tor  some  appropriate  choice  of  initial 
conditions  for  xm-  Since  eo  =  Cx—yu  =  Cx  —  AmCxst  — 
bur, we  have,  using(7) 

io  =  (CA  +  {CB)  GS)  (x  -  xm)  +  {CB)  [{Co  -  Co)r 
+  (Go  -  Go*)  X +  («-«*)] 

which  leads  to  (10). 

A  compact  form  of  equation  (10)  is 

eo={sI-AM)-^  (CB)[<I-.ui] 

where  the  matrix 

=  ((Co  -  Co*) .  (Go  -  Go) .  (ti  -  «•)) 
is  the  mx(m  -f  n  -f  1)  matrix  of  parameter  errors,  and  the 
vector  =  (r^,  r^,  l)  is  the  so-called  regressor  vector  o{ 
dimension  m  -f  n  -f  1.  The  product  is  carried  out  in 
the  time  domain,  and  the  resulting  signal  is  applied  to  the 
linear  time-invariant  operator  {si  —  Am)  {CD). 

Fact  2  The  update  law  ^  =  -  Geow^  leads  to  an  adap¬ 
tive  system  that  is  stable  in  the  sense  of  Lyapunov,  with 
the  properly  that  eo  tends  to  zero  as  t  —  oo,  provided 
that  {si  -  Axf)"'  is  a  strictly  positive  real  transfer  func¬ 
tion  matrix,  and  {CB)^  G  '  is  a  positive  definite  matrix. 


(11) 

(12) 

(13) 


Proof;  foUows  from  the  Kalman-Yacubovich- Popov 
lemma.  See,  e.g.,  [8]. 

The  algorithm  given  here  is  similar  to  the  one  available 
in  [8],  although  with  some  nontrivial  adjustments.  The 
main  differences  are  the  simplifications  resulting  from  state 
measurements  and  the  constant  disturbance.  SPR 

condition  can  be  satisfied  by  choosing  Am  +  Am  negative 
definite.  The  other  condition  requires  prior  knowledge  of 
the  matrix  CB,  which  is  the  high-frequency  matrix  men¬ 
tioned  earlier,  and  a  choice  of  the  adaptation  gain  matrix 
G  to  enforce  the  condition. 

3.5  Direct  Adaptive  Control  -  Input  Error 
Another  direct  adaptive  aJgorithm  can  be  obtained 
through  the  use  of  the  following  fact. 

Fact  3  The  following  identity  is  satisfied  for  all  lime 

u  —  CqBJ^  {y  —  Amv)  +  GqX  -1-  u  (14) 

Proof:  since  y  —  Am  y  =  {CA  —  AmC)  x  -f  {CB)  u  Cd, 
using  the  nominal  parameter  values  (7),  we  find 

y  —  Am  y  =  {CB)  (— Gqx  -(-  u  -  u  )  (15) 


and  (14)  follows. 

The  identity  (14)  leads  to  a  new  error  equation 

02  =  CoBf}^  {y  —  Amv)  A- Gox -tr  V  —  u  (16) 

Under  the  assumptions,  we  have  that 

e2  =  <&-z  (!') 

where  <1>  is  the  (controller)  parameter  error  defined  in  (13) 
and  z  is  a  new  regressor  vector  defined  to  be 


z  = 


-  ^sty) 


(18) 


The  main  difference  between  the  error  equation  (17)  and 
(8)  is  the  absence  of  the  transfer  function  between  the  pa¬ 
rameter  error  and  the  error  signal,  eliminating  conditions 
necessary  for  the  stability  of  the  adaptive  algorithm.  Fur¬ 
ther,  it  makes  possible  the  use  of  least-squares  algorithms, 
either  in  batch  or  recursive  forms. 
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Figure  1:  Parameters  for  the  UnfaLled  Aircraft 

3.6  Comparison  of  Adaptive  Algorithms 

The  three  algorithms  presented  here  all  achieve  the  same 
model  reference  control  objective,  although  through  quite 
different  structures  and  making  somewhat  different  as¬ 
sumptions.  The  following  elements  are  worth  considering: 

•  Number  of  parameters:  both  direct  methods  adjust 

the  same  number  of  parameters,  which  are  the  elements 
of  Co,  Go,  and  v.  The  total  number  is:  -t-  m.n  -f  m. 

In  the  case  of  the  indirect  algorithm,  the  parameters  that 
are  identified  are  the  plant  parameters,  and  there  are  n  -|- 
n.m-bn  of  them.  Since  m  <  n,  the  direct  methods  require 
less  parameters.  For  the  case  of  aircraft  control  with  a 
standard  reduced-order  model  (n  =  5,  m  =  3),  the  number 
of  parameters  is  45  in  the  case  of  the  indirect  algorithm, 
and  27  in  the  direct  case,  a  substantial  reduction. 

•  Prior  information:  the  direct  output  error  algo¬ 
rithm  requires  the  most  restrictive  assumptions,  by  im-^ 
posing  positive  definiteness  conditions  on  the  product  of 
the  inverse  of  the  adaptation  gain  matrix  with  the  high- 
frequency  gain  matrix.  Since  there  is  no  obvious  way  to  en¬ 
force  such  a  condition,  the  condition  practically  becomes  a 
sj'minetry  and  positive  definiteness  condition  on  the  high- 
frequency  gain  matrix  itself.  Guaranteeing  that  such  a 
condition  is  satisfied  is  all  but  trivial.  The  direct  input 
error  and  indirect  algorithms  also  require  conditions  on 
the  high-frequency  gain  matrix  to  be  implementable,  but 
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Figure  2:  Unfailed  Aircraft  Step  Responses.  From  top  to 
bottom:  pitcir  rate  response  to  pitch  command,  roll  rate 
response  to  roll  command,  and  yaw  rate  rcspon.se  to  yaw 
command, 

the  conditions  are  weaker.  In  the  case  of  the  indirect  algo¬ 
rithm,  the  estimate  of  the  high-frequency  gain  matrix  must 
be  nonsingular  at  all  times  for  the  control  law  to  be  imple- 
mentable.  For  the  direct  input  error  algorithm,  a  proof  of 
stability  reveals  the  necessity  for  the  adaptive  parameter 
Co  (as  opposed  to  the  estimate  of  CB  in  the  indirect  case) 
to  remain  nonsingular  at  all  times.  In  [6],  it  was  shown 
that  a  modification  based  on  a  sort  of  hysteresis  could  be 
incorporated  in  the  algorithm  with  a  least-squares  update 
to  guarantee  stability  of  the  overall  adaptive  system.  The 
condition  on  the  high-frequency  gain  matrix  was  only  that 
an  upper  bound  of  its  norm  must  be  known. 

•  Adaptation  algorithms:  with  the  direct  input  error 
algorithm  and  with  the  indirect  algorithm,  one  may  use  not 
only  recursive  gradient  algorithms,  but  also  least-squares 
rJgorithms  which  are  faster  and  more  efficient.  Another 
advantage  is  that  the  least-squares  algorithms  can  be  used 
in  their  batch  forms,  which  allow  for  monitoring  of  the 
quality  of  the  estimation  procedure  [l]. 

•  Flexibility  of  control  algorithm:  only  the  indirect 
approach  allows  for  the  use  of  alternate  control  strate¬ 
gies.  One  worthwhile  option  is  the  use  of  optimization 
algorithms  which  account  for  actuator  saturation  [5].  The 
direct  algorithms  rely  on  the  model  reference  formulation 
and  nontrivial  modifications  would  have  to  be  devised  to 
account  for  actuator  saturation.  Another  advantage  of  the 
indirect  algorithm  is  also  that  it  affords  the  opportunity  to 
incorporate  prior  information  about  the  plant  parameters 
in  the  estimation  procedure  [4]. 

4  Implementation 

4.1  Aircraft  Model  and  Assumptions,  and  Design 
Considerations 

Simulations  were  carried  out  using  a  detailed  simulation 
of  a  twin-engine  aircraft,  developed  at  NASA-Dryden  [2], 
The  model  is  a  complete  nonlinear  aircraft  simulation,  in¬ 
cluding  full  envelope  aerodynamics,  atmospheric  model. 


Figure  3:  Failed  Aircraft  Step  Responses  with  Original 
Controller.  From  top  to  bottom;  pitch  rate  response  to 
pitch  command,  roll  rate  response  to  pitch  command,  pitch 
rate  response  to  roll  command,  roU  rate  response  to  roll 
command. 

detailed  engine  dynamics,  and  actuator  dynamics.  The 
reconfigurable  control  system  design,  on  the  other  hand, 
is  based  on  the  reduced-order  model  using  the  five  states 
a,?,  p,  r.  The  control  inputs  are  denoted  Sh,  5/i,and  Sr. 
There  is  a  cross-feed  between  aileron  command  and  eleva¬ 
tor  command.  Specifically,  the  elevator  command  is  the 
sura  of  the  symmetric  deflection  Sh  a-ud  an  antisymmetric 
deflection  set  to  For  the  reconfigurable  control  law, 

there  is  nothing  that  forces  the  same  reduction  of  the  five 
independent  control  surfaces  to  three  control  inputs  (i.e., 
it  is  not  neccessary  to  keep  the  same  coupling  matrix). 
However,  it  is  convenient  to  keep  the  same  structure  for 
compatibility  with  the  originad  control  law,  and  for  other 
considerations.  While  treating  the  five  control  surfaces  as 
independent  might  lead  to  more  flexibility  in  the  recon¬ 
figurable  control  design,  it  would  also  require  that  these 
surfaces  be  actuated  independently  for  the  parameters  of 
the  B  matrix  to  be  identifiable. 

The  controlled  outputs  are  chosen  to  be  q,  p,  r.  It  was 
checked  that  this  choice  corresponded  to  minimum  phase 
zeros  for  the  flight  condition  under  consideration.  In  gen¬ 
eral,  it  is  possible  to  enforce  minimum  phase  properties  by 
replacing  q  and  r  by  9  -f  K^cr  and  r  —  KpP.  A  justification 
is  that  one  has,  approximately,  a  =  q  and  /5  =  — r  (al¬ 
though  the  precise  location  of  the  transmission  zeros  must 
be  calculated  with  the  coefficients  of  the  A  and  B  matri- 
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Figure  4;  Parameters  for  the  Failed  Aircraft 
ces).  The  choice  of  q,  p,  and  r  as  tracked  outputs  leads  to 
a  CB  matrix  that  is  a  9x9  matrix  whose  elements  ace  the  3 
moments  created  by  each  of  the  3  control  inputs.  As  long 
as  the  three  vector^  of  moments  are  linearly  independent 
(t.e.,  moments  in  all  three  directions  can  be  independently 
created),  the  matrix  CB  is  nonsingular,  so  that  the  as¬ 
sumptions  under  which  the  algorithms  were  derived  are 
satisfied. 

4.2  Experiments 

The  feasibility  of  the  model  reference  control  law  was  eval¬ 
uated  in  experiments,  using  a  diagonal  reference  model 
with  elements  ,  following  [7].  For  the  purposes  of 

this  paper,  we  simplified  the  problem  by  assuming  that 
identification  was  performed  before  reconfiguration,  using 
fairly  good  data  segments.  Work  is  in  progress  to  study 
the  complete  problem,  and  the  interaction  between  iden¬ 
tification  and  control  reconfiguration. 

The  aircraft  parameters  were  identified  using  data  seg¬ 
ments  of  10  seconds  with  rich  excitation,  and  a  batch  least- 
squares  algorithm.  A  standard  flight  condition  at  9,800  ft 
and  Mach  0.5  was  chosen.  The  responses  were  obtained 
bv  having  the  aircraft  under  the  control  of  a  simple  PID 


regulator  provided  with  the  simulation  package.  Multiple 
steps  of  reference  inputs  were  injected  to  provide  excita¬ 
tion.  The  results  are  shown  in  Fig.  1.  The  angles  are 
expressed  in  degrees  and  the  angular  rates  in  degrees  per 
second.  The  elements  of  the  matrices  conform  to  expecta¬ 
tions.  For  example,  the  A  matrix  has  elements  (1,2)  close 
to  1  and  (3,5)  close  to  -1,  with  a  large  (4,3)  element  (effec¬ 
tive  dihedral).  The  B  matrix  is  close  to  diagonal,  which 
is  to  be  expected  for  the  unfailed  aircraft.  The  values  of 
the  controller  matrices  were  found  using  the  relationships 
(7).  The  direct  identification  procedure  based  on  the  input 
error  gave  similar  results,  which  are  not  shown. 

Fig.  2  shows  the  step  responses  obtained  with  the 
model  reference  controller.  A  step  was  applied  in  the  pitch 
axis,  with  a  zero  reference  for  the  roll  and  yaw  axes.  Then, 
the  experiment  was  repeated  in  a  similar  manner  for  the 
roil  and  the  yaw  axes.  The  first  plot  shows  the  pitch  rate 
response  to  a  step  in  pitch  command.  The  dashed  line 
is  the  aircraft  response,  while  the  solid  line  is  the  refer¬ 
ence  model  response.  In  all  three  cases,  the  responses  are 
found  to  be  very  satisfactory.  The  cross-axis  responses 
were  found  to  be  small  and  are  not  reproduced,  due  to 
lack  of  space. 

An  interesting  observation  concerns  the  trim  of  the  air¬ 
craft.  During  the  identification,  the  aircraft  was  found  to 
be  trimmed  by  the  PID  controller  at  an  angle  of  attack 
of  about  4.6  degrees,  requiring  a  command  5//  of  approx¬ 
imately  -2.86  degrees.  The  identification  procedure  did 
not  identify  the  trim  value  for  6ii  or  a,  but  only  the  con¬ 
stant  disturbance  vector  v.  However,  the  trim  value  for  6// 
can  be  calculated  for  the  previous  angle  of  attack  to  be 
u(l)  H-  Go(l,  1)  o-irim  which  is  equal  to  0.02  —  0.64  »  4.6  = 
-2.92  and  is  remarkably  close  to  the  value  obtained  by 
the  PID  regulator.  In  other  words,  the  least-squares  pro¬ 
cedure  is  successful  in  determining  the  trim  value  required 
to  maintain  level  flight,  without  knowledge  of  what  that 
flight  condition  actually  is. 

Fig.  3  shows  the  response  of  the  aircraft  after  a  failure 
where  the  left  horizontal  tail  surface  was  stuck  at  its  value 
for  i  =  305  seconds.  The  responses  show  significant  degra¬ 
dation.  Pitch  response  is  much  smaller  than  desired,  and 
a  significant  roll  rate  response  is  incurred  with  the  pitch 
command.  These  two  effects  ate  to  be  expected  with  the 
loss  of  about  half  the  torque  in  the  pitch  axis  and  the  loss 
of  symmetry  leading  to  a  toll  moment.  Conversely,  there 
is  a  small  pitch  rate  response  to  a  step  in  roll  command 
and  a  reduced  response  in  roll  rate.  These  two  effects  can 
be  explained  by  the  cross-feed  from  roll  command  to  dif¬ 
ferential  elevator  discussed  earher.  The  cross-feed  leads  to 
a  pitching  moment,  and  a  partial  loss  of  rolling  moment  is 
also  incurred. 

Fig.  4  gives  the  parameters  identified  for  the  failed  air¬ 
craft.  The  same  PID  controller  was  used  to  obtain  the 
data,  although  this  controller  was  not  able  to  maintain 
level  flight  for  extended  periods  of  time.  The  B  matrix  ex¬ 
hibits  expected  changes.  The  pitching  effectiveness  of  the 
tail  is  reduced  by  a  factor  of  2,  from  -6.89  to  -3.51.  A  large 
rolling  moment  of  -7.15  rJso  appears  due  to  the  dissymet- 
ric  failure.  The  rolling  moment  effectiveness  of  the  lateral 
channel  is  reduced  from  18.72  to  13.08  due  to  the  loss  of 
effectiveness  of  the  cross-feed  to  the  elevator.  A  pitching 
moment  of  2.30  also  appears  for  the  same  reason.  Note 
that  the  trim  components  d  and  v  are  significantly  differ¬ 
ent  from  the  values  before  the  failure.  This  highlights  the 
fact  that,  even  if  the  aircraft  remains  stable  after  a  fail¬ 
ure,  keeping  the  previous  flight  condition  requires  a  new 
set  of  trim  values  that  must  be  identified  by  the  adaptive 
algorithm  if  the  aircraft  is  to  be  prevented  from  rapidly 
diverging. 

The  first  four  plots  of  Fig.  5  show  the  responses  of  the 
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Figure  5:  Failed  Aircraft  Step  Responses  with  Reconfig¬ 
ured  Controller.  From  top  to  bottom:  pitch  rate  response 
to  pitch  command,  roll  rate  response  to  pitch  command, 
elevator  response  to  pitch  command,  aileron  response  to 
pitch  command,  pitch  rate  response  to  roll  command,  roll 
rate  response  to  roll  command. 


aircraft  with  the  reconfigured  control  law  to  steps  in  the 
pitch  command.  The  pitch  rate  response  is  found  to  be  re¬ 
stored  and  the  cross-response  in  the  roll  axis  is  found  to  be 
significantly  reduced.  The  elevator  and  aileron  commands 
are  found  to  be  remarkably  similar.  Trim  value  aside,  one 
would  expect  Sa  =  0.58  6h  for  the  -7.15  and  13.08  terms  in 
the  B  matrix  to  cancel  each  other.  This  is  indeed  what  is 
observed.  The  last  two  plots  of  Fig.  5  show  the  responses 
to  a  roll  command.  Similarly  to  the  results  in  the  pitch 

axis,  the  cross-coupling  in  pitch  is  reduced  and  the  roll  re¬ 
sponse  is  improved.  Here,  one  would  expect  Sh  =  0.66  6a, 
for  the  terms  2.30  and  -3.51  in  the  B  matrix  to  cancel. 
Plots  for  these  variables  are  omitted  for  lack  of  space,  but 
the  relationship  indeed  holds. 
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Abstract 


The  problem  of  reconfigurable  flight  control  is  investigated,  focusing  on  model  reference  adapti  e 
control  and  leaat-scuares  identification  algorithnts.  Tl.e  issue  of  parametriaation  is  investigated, 
considering  both  direct  and  indirect  algorithms.  Different  least-squares  adaptation  procedures  are 


discussed.  Results  for  the  implementation  of  an  input  error  direct  adaptive  control  algorithm 
rouetlrer  rvi.h  a  detailed  aircraft  mode,  are  presented.  An  emphasis  of  the  discussion  is  on  the 


multivariable  nature  of  the  adaptive  coatiol 
appear  after  failures,  and  that  the  adaptive 
rotational  axes.  Tracking  of  the  commands  a 
achieved. 


problem.  It  is  shown  that  strong  cross-couplings 
algorithm  is  able  to  restore  decoupling  between  the 
.ml  steady  hight  at  the  pre-failure  condition  are  also 


1  Introduction 

ReconSgurabic  flight  control  has  been  a  topic  of  research  for  several  years.  The  goal  is  to  create 
control  systems  that  can  compensate  for  farlures  and  damages  to  aircraft.  Such  systems  will  in¬ 
crease  the  survivability  of  aircrafts,  r.c.  them  ability  to  sustain  damage  while  maln.aming  sufficient 
maneuverabiffty.  Reconfigurable  flight  control  systems  should  also  help  to  reduce  costs.  Indeed, 
redundant  systems  are  currently  used  to  provide  bach-ups  for  failed  systems.  Reconfigurable  flight 
control  systems  may  reduce  the  need  for  redundancy,  and  therefore  lower  aircraft  costs. 

Reconfigurable  flight  control  systems  compensate  for  failures  using  the  unfailcd  portions  of 
the  control  surfaces.  If.  for  example,  one  of  the  airplane’s  elevators  becomes  jammed,  the  commands 
to  the  other  elevator,  as  well  as  to  tile  a, Icons  and  to  the  rudder,  are  modified  to  compensate  for 
that  failure.  The  approach  discussed  in  the  thesis  tor  the  design  of  a  reconfigurable  control  system 
is  adaptive  control.  A  simple  view  of  an  adaptive  controller  is  the  combination  of  a  non-adaptive 
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estimate  unknown  parameters.  In  indirect  adaptive  control,  these  parameters  may  be  the  matrn 

ces  of  a  state-space  representation,  or  the  numerator  and  denominator  polynomial  of  a  transfer 

function  representation.  These  estimates  are  used  to  select  feedback  gains  to  control  the  plant.  In 

contrast,  in  direct  adaptive  control,  the  identifier  estimates  the  feedback  gains  themselves.  Smce 

techniques  to  solve  systems  of  linear  equations, 

hetlier  the  problem  can  be  cast  in 


efficient  algorithms  are  available  from  least-squares 

an  important  issue, 
the  form  of  a  set  of  Unear  equations. 


for  both  direct  and  indirect  approaches,  is  wl 


An  adaptive  algorithnr  has  notable  advantages  over  other  methods.  Failure  .dent.fication 
methods  are  limited  by  what  failures  the  system  was  des.gned  to  compensate  tor.  If  a  failure  oc 
curs  which  is  not  one  of  those  that  had  been  anticipated,  the  system  will  not  be  able  to  rdent.ty 
the  failure  correctly,  and  the  controUer  wiU  no.  be  reconfigured  properly.  Another  advantage  of 
adaptive  methods  is  that  they  aUow  for  reconfiguration  not  only  for  a  broad  range  of  failures,  but 

also  for  varying  flight  conditions. 


1.1  Reconfigurable  Flight  Control  Systems 

1.1.1  Failure  Detection  and  Classification 


In  1984,  the  U.S.  Air  Force  began  a  program 
FCS)  Program  [13].  The  objectives  of  the  program  were 
survivabiUty  and  Ufe  cycle  cosP^  of  aircraft  [13,  pg  504].  There  were 
achieving  the  goals  of  the  program.  The  first 


called  the  Self- Repairing  FUght  Control  System  (SR- 
to  “improve  the  reUabiUty,  maintainabiUty, 
two  main  thrusts  involved  in 

method  was  the  development  of  reconfigurable  flight 
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control  systems  (FCS),  meaning  control  systems  that  conid  be  modified  to  compensate  for  change 
conditions.  The  changing  conaions  cohdsted  ifi  a  failute  of  the  airflafie’s  control  surfaces  or 

sensors,  or  in  damage  to  the 

ity.  The  second  component  of  the  program  was 
Improvements  would  come  as 


aircraft.  Reconfigurable  FCS  would  improve  reUability  and  survivabil- 
the  use  of  on-Une  diagnostics  to  identify  failures. 

reductions  in  maintenance  and  repair  costs. 


In  developmg  the  reconfigurable  FCS,  the  Air  Force  decided  on  an  approach  based  on  .den- 
tilying  which  component  had  failed,  then  modifying  the  FCS  based  on  the  identified  problem.  The 
faiinrc  was  identified  through  a  Failure  Detection  and  Isolation  (FDI)  procedure.  A  local  FDI 
algorithm  was  used  to  detect  actuator  failures,  while  a  global  FDI  algorithm  determiued  surface 
damage.  The  global  FDI  used  a  model  of  the  aircraft  to  compare  the  measured  output  with  the 
capected  output.  The  error  between  the  two  tvas  passed  through  several  filters.  The  output  of 
each  filter  represented  a  Ukelihood  that  the  failure  represented  by  the  filter  had  occurred.  These 
likelihoods  were  used  by  a  Pseudo-Surface  Resolver  (PSR)  to  determine  how  control  of  the  aircraft 
could  be  maintained.  The  PSR  used  a  “modified  pseudo-inverse  that  minimizes  changes  in  control 
defiections  after  failure  to  maintain  forces  and  nronren.s."  [13.  pg  508]  The  maintenance  diagnostics 

worked  in.  the  ssime  mmmer. 


There  were  several  Umita.ions  to  this  approach.  One  of  the  more  severe  limitations  was 
that  any  modeling  error  could  be  interpreted  as  a  failure.  Another  limitation  was  that  this  method 
could  only  correctly  identify  errors  that  feU  into  a  select  few  categories.  However,  the  approach 
proved  successful  in  the  flight  tests  and  was  capable  of  handling  various  types  of  failures  of  the  right 
stabilator.  Drnes.  Yeager  fc  Stewart  (29)  concluded  that  “the  test  results  of  the  Self-Repairing  Fbgh. 
Control  System  installed  on  an  F-15  aircraft  indicate  high  potential  for  the  concepts  evaluated,” 
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Another  major  study  of  reconflgurable  flight  control  systems  was  sponsored  by  NAS  A  Lang¬ 
ley  and  carried  out  by  Alphateeh  (c/.  [31]).  The  study  considered  the  appHcation  of  reconfiguration 
strategies  to  stable,  commercial  aircraft  (Boeing  737).  A  single  flight  condition  was  assumed  but 
a  large  variety  of  possible  failures  were  simulated.  The  approach  was  the  precursor  of  the  SRFCS 
approach  described  above,  consisting  of  two  main  components:  a  failure  detection  «r  ident.fica- 
module,  and  a  control  reconfiguration  module.  The  control  design  in  the  NASA/Alphatech 


tion 


study  was  not  based  on  the  approrimation  of  the  unimpa.red  control  actions,  but  on  a  redes.gn  of 
the  control  law  using  linear  quadratic  (LQ)  regulator  theory.  This  approach  is  described  .n  [18] 
and  consists  in  specify.ng  weighting  matrices  ,n  an  LQ  problem  so  that  the  resulting  ciosed-loop 
system  satisfies  some  bandwidth  constraints.  Only  changes  in  the  B  matr.x  of  the  state-space 
representation  were  considered,  but  the  stud,  included  the  possibility  of  incorporating  knowledge 

of  uncertainties  in  the  estimated  B  matrix  in  the  design. 

The  Alphateeh  project  also  studied  extensively  the  problem  created  by  changes  in  trim 
conditions,  which  act  as  constant  disturbances  to  be  added  to  the  state-space  model.  An  automatic 
trim  algorithm  was  developed,  based  on  an  optimisation  procedure,  and  is  reported  in  132).  Of 
related  interest  is  the  work  of  Ostrotf  pd],  which  also  considers  the  automatic  coutrol  redesign  for 
a  Boeing  737  aircraft,  but  suggests  the  incorporation  of  integral  action  in  the  control  law  to  solve 
the  trim  problem  (essentially  considering  it  as  a  disturbance  rejection  problem).  Simnla.ions  tor  a 

mildly  unstable  aircraft  model  are  also  reported  in  this  work. 

Several  other  researchers  have  worked  on  design  methods  based  on  linear  quadratic  (LQ) 
techniques,  assuming  that  the  detection  problem  was  solved  independently.  Huang  &  Stengel  (131 
presented  an  automatic  redesign  method  based  on  iinpbei,  model  fo, .owing,  incorporating  integral 
action.  Moerdet  e«  uL  PU  studied  the  appbcation  of  LQ  eontroBers.  bu.  assumed  that  control 
gains  were  to  be  scheduled  according  to  the  decision  of  the  failure  detection  and  ideutificatiou  logic 
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(as  opposed  to  being  calculated  in  real-time). 

Of  related  Interest  is  the  work  of  Maybeck  k  Stevciis  [19],  which  suggests  a  somewhat 

different  approach.  While  assnmirrg  that  the  possible  failures  have  been  categorized,  the  method 
rehes  on  a  bank  of  Kalman  Wters  to  estimate  the  states  of  the  system  based  on  the  different 
assumptions.  Residual  errors  are  used  to  calculate  the  probabilities  of  individual  failures  and  the 
control  input  is  the  weighted  average  of  the  signals  calculated  under  these  respect, ve  assumpt.ons. 
This  is  significantly  different  from  the  SRFCS  approach  where  the  control  input  corresponding  to 

the  most  likely  failure  is  chosen. 

In  pol,  a  multiple  ntodel  adapt, ve  estimation  approaeh  is  used.  Failures  are  represented  as 
a  veetor  of  unknown  systen,  parameters,  related  to  effectiveness  of  various  actuators  and  sensors. 
Fault  detection  is  achieved  through  the  use  of  filters,  where  each  filter  is  essenttally  a  ntodel  of  fhe 
VISTA  F-16  in  winch  a  certain  actuator  or  sensor  has  failed.  How  well  the  filter  response  and  the 
airplane  response  match  gives  an  estimate  of  how  Ukely  it  is  that  the  particular  failure  has  occurred. 
If  a  partial  failure  has  occurred,  then  the  probability  that  the  failure  has  occurred  represents  the 
loss  of  effectiveness  of  the  sensor  or  control  surface.  Each  failure  has  a  set  of  control  law  gains 
associated  with  it.  The  controller  combines  the  gams  based  on  the  probabiUties  generated.  One  of 
the  difficnlties  of  this  approach  is  that  there  must  be  a  non-zero  input  to  the  system.  If  not.  false 
failures  are  sometimes  detected,  or  real  failures  are  not  detected.  Normally,  the  commands  neces¬ 
sary  to  perform  a  maneuver  are  sufficient  to  excite  the  system.  During  steady  level  flight,  however, 
there  is  not  sufficient  excitation.  The  solution  to  this  problem  was  to  add  a  small  sinusoidal  input 
to  the  system  to  ensure  sufficient  excitation. 
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1.1.2  Nonlinear  and  Adaptive  Control 


to  the  .etWt  described  above,  se.etel  teseatcbets  have  searched  fot  methods  that  do 
„„t  depead  oa  the  tdea.ihcarioa  ot  the  t..ates  betoce  tahia,  acrioa.  A  variety  of  dhecf.oas  have 

been  pursued. 

A  logical  first  step  coasists  la  lookiag  for  a  robust  linear  coatrol  law  that  would  be  sat- 
■,3factor,  for  all  possible  la, paired  aircrafts,  and  would  aclaeve  the  required  perforaraace  for  the 
ualarpalred  aircraft.  Schaelder.  llorowlta  Houpls  ,261  coas.dered  the  use  of  guaat.tat.ve  leed- 
back  theory  for  that  purpose.  However,  Chandler  |Cl  illustrated  with  several  eaaarplcs  that  rt  rs 
geaerallv  not  possible  to  design  a  robust  linear  control  law  that  guarantees  stability  for  the 
paired  conditions  while  providarg  sat.sfaCor,  perfornrance  (or  the  nonra.a,  unfa, led  cond.t.ons,  e 

the  reconfigurable  control  law  to  take  action,  but  inrplied  that  sonre  forn,  of  recoafigurat.on.  for 

example  nonlinear  or  adaptive  control,  would  be  necessary. 

Olttmar  ,U,  Invest, gated  the  use  of  an  adaptive  coatrol  approach  based  oa  hyperstablll.y 
and  the  algorl.hnrs  developed  by  Landau  [IT,.  A  slarularioa  study  concluded  that  the  performance 

accommodating  a  larger  number  of  failure  modes. 

Morse  Ossmaa  [22,  also  considered  a„  adaptive  control  approach  for  the  AFTI/F-16,  us.ag 
algorithms  of  Sobel  t  Kaufmaan  (271,  While  they  pointed  out  some  problems  with  the  speclic 
theory  that  they  were  using,  the  authors  developed  their  own  design  method  for  the  selectron  o 
the  parameters  of  the  algorithm  and  showed  that  the  method  was  successful  even  in  the  presence 

of  multiple  failures. 

Gross  &  Migyaako  114,  considered  the  use  of  “supercoatroller”  technology,  winch  ,s  a  form 
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of  nontoar  control  baaed  oa  polynomial  networks.  Coefficients  of  the  polynonaials  were  adjusted 

using  an  optimisation  program  and  a  data  base  of  optimal  responses.  This  method  was  further 
developed  in  [1]  and  connected  to  recent  work  in  neural  networks.  Sofge  fe  White  ps]  also  mention 
efforts  currently  under  way  at  McDonneE  Douglas  in  the  neural  network  area.  It  is  interesting  to 
note  that  while  the  implementation  of  the  controUers  does  not  rely  on  expUdt  failure  recognition, 
the  training  of  the  networks  does,  so  that  this  method  can  be  considered  a  hybrid  between  the  two 
approaches  d.scussed  in  tins  brief  overview  of  the  literature  on  reconfrgnrabie  li.ght  control  systems. 

Research  on  the  application  of  adaptive  methods  to  reconfign table  eo.ttrol  has  also  been 
carried  out  at  Wright-Patterson  Air  Force  Base  (71.  |8|.  [9|,  (lO].  The  emphasis  of  the  research  has 
been  on  the  use  of  constrai.ted  least-s<,uares  identilication  methods  and  model  predictive  control. 
The  results  have  been  restricted  to  single-, nput  siugle-output  pitch  axis  models,  but  they  have 
successfully  included  actuator  rate  saturation  ,n  the  design,  as  well  as  prior  .nformation  on  the 

Stability  derivates. 


1.2  Overview 

This  thesis  discusses  some  of  the  issues  and  problems  w.th  implementing  multivariable  adaptive 
control  for  reconftgurable  flight  control.  Although  we  focus  on  model  reference  control,  several 
formulations  of  the  problem  are  discussed,  as  weU  as  several  adaptation  algorithms.  We  discuss 
the  hrtear  parametrisation  of  the  aircraft  dynamics,  for  both  the  direct  and  indirect  approaches. 
The  algorithms  are  implemented  together  with  a  complete  nonlinear  simulat.on  of  a  twin-engine 
jet  aircraft,  and  the  performance  of  various  approaches  are  evaluated  and  compared.  The  thes.s 
expands  on  previous  results  presented  in  [4]. 
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2  Adaptive  Control  Algorithms 


2.1  Aircraft  Model 

The  kinematic  behavior  of  an  airplane  is  governed  by  a  set  of  nonlinear  differential  equations.  For 
Bight  control  system  design,  these  equations  can  be  approximated  effectively  by  a  set  of  linear 

differential  equations. 


/\x  d"  ^ 


y  =  Cx 


(1) 


^  •  1  (n  firrouut  foi'  the  trim  values  of  the  input  necessary  to  maintain 

A  disturbance  term  d  is  included  to  account  loi  me 

Bight  at  the  operat.ng  point.  By  explicitly  including  this  disturbance,  the  teconflgutable  flight 
control  system  can  automatically  set  the  trim,  freeing  the  pilot  from  having  to.  Because  the  trnn 
values  may  change  radicaUy  after  a  failure,  such  a  feature  is  quite  important. 

The  states  of  the  aircraft  are  represented  by  x.  The  longitudinal  states  are  a  (angle  of 
attack),  ,  (pitch  rate),  h  (altitude),  and  .  (velocity).  The  lateral  states  are  ^  (sideslip),  P  (roU 
rate),  r  (yaw  rate),  4-  (roU  angle),  and  4  (yaw  angle).  For  the  design  of  stabiUty  augmentation  lirght 
control  systems,  the  state  vector  can  be  reduced  to  only  live  states;  a.  9.  /3.  P,  r-  The  control  inputs 
„  are  also  divided  Into  lateral  and  longitudinal  inputs.  The  longitudinal  inputs  are  ds  (elevator 
command)  and  dp  (thrust  command).  The  lateral  inputs  are  dn  (aileron  command)  and  dn  (rudder 
command).  In  the  aircraft  under  consideration  in  the  simulations,  the  elevator  command  dg  is 
interpreted  as  a  symmetric  command  to  the  elevators,  denoted  d„,  and  a  dilferential  command  to 
rhe  ailerons,  denoted  dp.  For  stahility  augmentation  flight  control  system  design,  dx  is  nsnaUy  not 

considered. 

There  are  several  choices  available  for  the  control  output  p.  The  states  q,  p,  and  r  are 
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good  choices  for  low  dynamic  pressure  and  Umited  angle  of  attack  (c/,  [15]).  The  problem  ts  then 
the  control  of  a  three-input,  three-output,  linear  time-invariant  system.  Because  of  symmetry  in 

the  unfailed  aircraft,  the  lateral  and  longitudinal  axes  are  usually  decoupled.  However,  after  a 
failure  the  airplane  win  nsuahy  no  longer  be  symmetric.  Therefore,  the  lateral  and  longitudinal 
axes  cannot  be  decoupled.  The  trim  conditions  ma,  also  change  much  faster  at  the  time  of  a  failure 
than  they  would  m  the  nnfailed  aircraft,  especially  if  a  control  surface  becomes  floating  or  missing. 

2.2  Adaptive  Control  Algorithms 

Several  adaptive  algorithms  can  be  nsed  in  reconfigurable  Bight  control.  The  control  objective 
considered  here  is  based  on  model  reference  control.  .-V  motivation  is  that  this  object.ve  allows  ns  to 
easily  incorporate  considerations  of  tracking  and  deconpling  into  design.  Fnrther,  relatively  simple 
algorithms  are  obtained.  Several  adaptation  mechanisms  have  been  proposed  in  the  literature  (see 
(251).  Those  presented  here  are  modified  shghtly  to  include  the  constant  disturbance  d,  and  to 
exploit  the  fact  that  all  the  states  and  their  derivatives  are  measured.  State  variable  filters  are  not 
needed  to  reconstruct  the  state  and  a  direct  state  feedback  control  law  can  be  used. 


Assumptions  and  Reference  Model 

We  consider  the  state-space  model  for  the  plant  (1),  where  x  £  fl”.  «  £  B”.  9  £  fl".  “d  £  -«”■ 
We  assume  that  the  whole  state  x  is  available  for  measurement,  although  only  the  output  y  is  to 
be  tracked.  The  objective  is  for  ,  to  match  the  output  w  of  a  reference  model 

=  A^f  VM  4-  Bm  ^ 


(2) 


where  w  £  U”  and  r  £  fl”.  The  matrices  and  are  arbitrary  square  matrices,  with  A„ 
stable.  For  the  model  reference  control  problem  to  have  a  relatively  simple  solution,  we  assume: 
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Assumption  1:  The  plant  has  relative  degree  1,  he.  detiCB)  7^  0. 

Assumption  2:  The  plant  transfer  function  is  minimum  phase,  i.e.  the  zeros  of  transmission 
the  system  are  in  the  open  left-half-plane. 

The  Brst  =u.sumptioa  guarantees  that  the  dosedloop  transfer  function  of  the  plant  can  be 
nrade  to  match  the  transfer  function  of  the  reference  model  (2)  using  a  proper  compensator.  If 
the  assumption  is  not  satisfied,  the  model  reference  control  problem  may  still  be  solvable,  but 
a  more  complex  reference  model  would  have  to  be  chosen,  so  as  to  match  the  so  ca 
normal  form  of  the  plant  (c/.  [25]).  When  the  first  assumption  is  satisfied,  this  Hermrte  form 
is  simply  //(s)  =  diag{l/s)  cvhich  means  tha,  the  behavior  of  the  plant  at  infinity  is  that  of  a 
multivariable  integrator.  The  matnx  CB  is  called  the  lmjk-fro,nency  ,ja.u  mutex  of  the  plant  and 
,s  usually  denoted  Kr  in  the  adaptive  control  literatrrre.  It  is  a  most  critical  parameter  for  adaptive 

algorithms. 

The  second  assumption  is  a  necessary  assumption  to  guarantee  the  internal  stability  of 
the  mode!  reference  control  algorithm.  The  dimension  of  the  state-space  mode!  is  m.  while  the 
dimension  of  the  state-space  model  for  the  plant  is  «.  Therefore  n  -  m  modes  must  be  made 
unobservable  or  uncontrollable.  It  can  be  shown  that  the  model  reference  control  law  places  m 
modes  of  the  plant  at  the  desired  model  reference  locations,  and  makes  the  others  unobservable  by 
placing  them  at  the  locations  of  the  transmission  zeros. 


Model  Reference  Control  Law 

We  consider  the  state  feedback  control  law 

u  =  CoT  +  Gqx  -b  V 

where  Co  €  €  R^  are  free  controller  parameters.  The  control  law  is  repre 


(3) 
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Figure  1;  Model  llcference  CoiUrol  Loop 
seated  iu  Figure  1.  The  closed-loop  dynamics  are  given  by 

i  =  Ax  +  DCor  +  BGox  +  Bv  +  d 

y  =  Cx 

or,  in  terms  of  the  output  y, 

y  =  {CA  +  CBGq)x  +  CBCor  +  CBv  +  Cd 

(5)  leads  to  the  same  input/outpet  relatioaship  as  that  ol  the  refereace  model  (2)  for  the  so 
nominal  values  of  the  controller  parameters 

C*  =  {CB)-^Bm 

G*  =  {CB)-\AmC  -CA) 

V*  =  -{CB)~\Cd) 


(4) 


(5) 

called 


(6) 
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If  the  plant  waa  known,  these  would  he  the  controUet  patanteters  that  one  would  use  to  achieve 
the  model  reference  control  objective. 

Indicect  Adaptive  Control  Indirect  adaptive  control  involves  two  stages:  First,  estinrates  of 
the  plan,  parameters  A.  B,  and  d  are  generated.  Once  the  plan,  parameters  have  been  estimated. 

used  to  generate  controller  parameters.  If  A.  B,  d  are  the  estimates  of  A.  B,  d. 


the  estimates  are 
an  error  vector  can  be  defined  by 


Cl 


Ax  +  Bu  +  d  —  X 


(7) 


Given  the  assumptions,  the  error  vector  can  also  be  expressed  as 

ei  =  (A  -  A)x  +  (/)  -  B)u  +  (d  -  d) 


(8) 


A  least-squares  algorithm  can  be  used  to  find  the 


estimates  of  A,  5,  and  d  which  minimize  the 


sum  of  the  norm  of  cr  evaluated  at  several  samplirrg  instants.  Each  row  of  (7)  can  he  treated 
independently  in  this  process.  The  algorithm  requires  the  measurement  of  x.  The  derivative  can 


often  be  obtained  by  filtered  differentiation 


of  X.  For  flight  control  systems,  differentiation  is  not 


usuaUy  needed,  as  the  derivatives  can  be  measured  directly. 

Once  the  estimates  are  obtained,  the  controller  parameters  Co,  Go,  and  u  in  (3)  are  obtained 

from 


Co  =  {CB)-^Bm 
Go  =  {CBr\AMC-CA) 
V  =  -{CBr\Cd) 


(9) 


This  is  the  same  relationship  as  (6),  replacing  the  plant  parameters  with  their  estimates.  In  addition 
to  the  issue  of  stabiht,.  a  major  guest.on  remains  to  he  resolved:  what  to  do  when  CB  is  singular. 
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There  are  few  methods  which  satisfactorily  address  this  problem,  from  a  practical  poiiit-of-view, 
bat  it  has  beea  a  sabject  of  recent  research  in  the  adaptive  control  field. 

Direct  Adaptive  Control  Instead  of  estimating  the  plant  parameters,  a  direct  adaptive  control 
algorithm  estimates  the  controher  parameters  Co,  Go,  and  n.  There  are  two  main  approaches: 

output  error  and  input  error. 

Output  Error  The  output  error  e„  .s  defined  as  the  difference  between  the  plant 

ontpnt,  and  the  output  from  the  reference  model,  when  given  the  same  input  as  the  plant.  Because 
the  system  is  assumed  to  have  relative  degree  1,  ll.orc  is  no  need  for  the  so-called  augmented  error, 
and  a  simple  algorithm  results.  The  following  fact  is  used. 

Fact  1:  The  output  error  cq  satisfies 

eo  =  (si  -  AMr\CB)[{Co  -  Co)r  +  {Go  -  Go)x  +  {v  -  n‘)]  (10] 


Proof:  Define 

=  Amxm  +  BCqT  +  BGIxm  +  Bv*  +  d 


(11) 


so  that  VM  =  CXM  for  some  appropriate  choice  of  initial  conditions  for  Since  do  -  Ci  VM 
Ci  -  AmGxm  -  Bmt,  along  with  (6),  we  find 

do  =  {CA  +  {CB)Gl){x  -  xm){CB)[{Co  -  Co)r  +  (Go  -  ^0)2:  +  (t;  -  «*)]  0 


which  leads  to  (10). 

Equation  (10)  can  be  expressed  compactly  in  the  form 

eo  =  {si  -  Am) 


(13) 


where 
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$  =  ((Co  -  Co)  (<^0  -  Go)  -  ^*)) 


(14) 


,  the  »x(,n  +  »  +  1)  matrix  of  paraxneter  ergots,  and  the  tepeeaeor  eectoe  D  has 

dimension  m  +  n  +  1.  The  prodnct  .  • »  is  carried  out  iu  the  time  dontai..,  and  the  result, ug  s.gual 

is  applied  to  the  Unear  time-invariant  operator  (s/  -  Am)  (C^)- 

Fact  2:  The  update  law  i  =  -Gc„»T  leads  to  a„  adaptive  system  that  is  stable  in  the  sense 
or  Lyapunov,  with  the  property  that  e„  tends  to  aero  as  ,  ^  oo,  provided  that  (sf  -  A.,)-  ,s  a 
strictly  positive  real  transfer  functiou  matr.x,  aud  (CBfG^’  is  a  positive  delinite  matnx. 

Pwo!:  follows  from  Kalman- Yaculiovich-Popov  lemma  (e.g.  |23]). 

This  algorithm  is  sm.ilar  to  other  algorithms,  such  as  the  one  available  in  [23],  with  some 
adjustments  needed  because  ot  the  constant  disturbance  d  aud  the  state  measurement.  The  strictly 
positive  real  condition  can  be  sat, shed  by  choosing  A„  so  that  Am  +  AJ,  is  negative  dehmte.  The 
other  condition  requires  prior  knowledge  of  the  high-freguenc,  gain  matrix  CB,  and  an  approprrate 

choice  for  the  adaptive  gain  matrix  G. 

Input  Error  Another  formulation  of  the  direct  adaptive  algorithm  uses  the  foUowing  fact. 
Fact  3:  The  following  identity  is  satisfied  for  all  time. 


Proof:  From  (1),  we  find 


y  _  Amv  =  CAx  +  CBu  +  Cd-  AmCx 


(16) 


Using  the  nominal  parameter  values  in  (6),  we  get 

y^AMy  =  (CB)(-GoX  +  u-v*) 


(17) 
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from  which  (15)  follows. 


A  new  error  equation  can  be  defined  from  (15). 

62  =  CoBj}{y  -  Amv)  AGqx^v-u 


(18) 


Given  the  assumptions,  (18)  can  be  expressed  as 

62  =  d>  ■  Z 


(19) 


4.  is  the  co.Uroller  error  dcfororl  i,.  (M),  and  r  is  a  „ca.  regressor  vector  dermed 

/  \ 


as 


^  BjJiii  -  Amp) 


V  '  J 

The  rrrairr  differerree  between  the  error  equat.ons  (19)  and  (13)  is  the  absence  of  the  transfer 
functron  between  the  pararrreter  error  and  the  error  signal.  This  eliminates  the  str.ctly  positive  real 
condition  necessary  for  the  stability  of  the  algorithm,  inclnding  the  condition  on  the  high-frequency 
gain  matrix.  Also,  it  makes  possible  the  use  of  least-squares  algorithms. 


The  three  algorithms  presented  above  aU  achieve  the  same  model  reference  control  objective, 
but  with  different  structures  and  different  assumptions.  The  Mowing  issues  should  be  considered: 

Number  of  Parameters:  Both  direct  methods  estimate  the  controUer  parameters  Co, 
a,  and  n.  These  have  a  total  of  -b  mu  -h  "•  elements.  The  indirect  algorithm  estimates  A,  B 
and  d,  which  have  a  total  of  -b  mn  +  n  elements.  Since  m  <  n  in  this  application  and  in  generai, 
the  direct  algorithms  estimate  fewer  parameters.  For  the  reduced-order  aircraft  model,  n  =  5  and 
m  =  3.  The  indirect  algorithm  must  estimate  45  parameters,  while  the  direct  algorithms  only 

estimate  27. 
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Prior  Information:  The  direct  output  error  algorithrr.  requires  the  most  restrictive  as¬ 
sumptions,  by  imposing  a  positive  deliniteness  condition  on  the  product  of  the  transpose  of  the 
high-frequency  gain  matrix  with  the  inverse  of  the  adaptation  gain  matrix.  There  is  no  obvious 
way  to  enforce  this  condition.  For  aU  practice  purposes,  this  condition  is  a  symmetry  and  positive 
deiimteness  condition  on  the  high-frequency  gmn  matrix  itseh,  and  is  not  easily  guaranteed.  The 
direct  input  error  artd  indirect  algorithms  place  less  stringent  conditions  on  the  high-frequency  gam 
matrix  The  indirect  algorithm  requires  that  the  estimate  of  the  high-frequency  gain  matrix  be 
uonsingular  at  all  times.  It  cau  be  shown  that  the  direct  ...put  error  algorithm  requires  that  the 
parameter  Co  must  be  nonsingular  at  all  times.  In  [12),  it  was  show.,  that  the  stability  of  the  overall 
adaptive  syste.n  could  be  guaranteed  by  incorporatiug  a  modification  based  on  a  sort  of  hysteresis 
into  the  algorithm  with  a  least-squares  update,  and  requiring  an  upper  bound  on  the  ..or...  of  the 

high-frequency  gain  matrix. 

Adaptation  Algorithms.  The  i..direct  and  direct  input  error  approaches  can  be  used  with 
least-squares  algorithms,  in  add.tion  to  gradient  descent  ^gorithms.  The  least-squares  algorithms, 
which  are  faster  and  more  efficient,  can  be  used  in  their  batch  or  recursive  forms.  The  batch  forms 

iend  themselves  to  monitoring  of  the  estimation  quality  [3]. 

Flexibility:  The  indirect  algorithm  is  the  most  flexible  of  the  three  approaches.  Other 
control  strategies  than  model  reference  can  be  used,  such  as  model  predictive  control  (fij.  The 
direct  algor.thms  are  not  easily  modified  away  from  the  reference  model  formulation.  The  indirect 
algorithm  also  has  the  Hexibility  to  incorporate  prior  knowledge  on  the  plant  parameters. 


Given  these  considerations,  it  was  decided  that  the  direct  input  error  algorithm  was  best 
suited  to  the  problem  of  reconfigurable  flight  control.  Given  the  large  number  of  computations 
required,  it  was  felt  that  fewer  parameters  was  a  siguificant  advantage.  Also,  the  least-squares 
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algorithms  are  known  to  converge  much  faster  than  the  gradient  descent  algont 
anmher  of  parameters  is  large.  This  is  especially  important  considering  that  quick  reconSguration 
aiay  be  critlcaf.  A  positive  definiteness  condition  on  the  high-fregnency  gain  matrU  could  also 
nardly  be  justified  in  this  appfication.  Therefore,  the  output  error  algorithm  was  not  further 

1  ■  d  nf  tlik  thesis  it  is  impUed  that  the  direct  input  error  formulation  of 

considered.  For  the  remainder  of  this  thesis,  it  is  imp 

adaptive  control  is  being  used. 


Adaptation  Algorithms 

Given  a  set  of  iinear  equations  with  unknown  coemcien.s,  the  leasTsgnares  algorithm  constitutes 
a  fast  and  efficient  way  to  find  the  set  of  coefficients  which  most  closely  match  the  equations. 
The  direct  input  error  algorithm  is  easily  foimulated  so  that  least-squares  estimation  can  be  used. 
Define  the  regressor  vector  a  as  in  (20),  and  the  parameter  matrix  0  as 

(  —'tn  \ 


e 


cl 
cl 

\  ) 


(21) 


where  Co,  Co,  and  u  are  the  control  law  parameters  in  (3).  The  input  error  e.  can  then  be  expressed 


as 


62  =  U 


e^z 


(22) 


where  u  is  the  system  input. 

Least-Squares  Algorithm:  Wc  treat  each  component  of  e,  independently.  The  problem 
then  becomes  that  of  finding  a  vector  «,  such  that  the  sum  of  over  A  sampling  instants  ,s 

minimized,  where  d,  is  the  column  of  d,  and  cz,  is  the  element  of  The  least-squares 


solution  is 
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The  solutions  for  the  different  columns  of  d  can  be  combined  in  a  single  expression. 

derived  from  tire  batch  solution.  If  we  define 


The  recursive  least-square  (RLS)  algorithm  can  be 

the  covariance  matrix 


(25) 


^fc=i 


the  batch  least-squares  for  n  sampling  instants  is  ^ 

fl[7i]  =  P[n\  ( Y1 


(26) 


tit=i 


P\n\z\n  +  l]z^\n  +  1]-PW 
P[n  -h  1]  =  PbA  "  ^  ^  +  l]P[n]z[n  -h  1] 


e[n  -h  1]  =  0[n] 


P[n]z[n  -f  lUz'^ln  +  l]g[n]  - 

± z — — rrr -  -  ^  T  r  .  1  T 


From  this,  we  derive 

(27) 

,  _  (28) 

l]P\n\^  +  if 

•.n  T?  ftine  Factor*  The  least-squares  suffers  from  what  is  caUed 
Least-Squares  with  Forgetting  t actor. 

1  yri-wTffcl  can  grow  without  bound,  the 

•  nrnhlem  1251  Because  the  sum  Efcr=i  ^ 

the  covariance  wind-up  proDiem  ^ 

matrix  C  can  become  arbitrarily  smah.  Adaptation  will  become  very  slow,  which  is  unaccepta  e 
in  the  adaptive  problem  under  consideration. 

One  solution  to  the  covar.ance  w.nd-up  problem  is  to  replace  the  least-squares  algorithm 
„ith  the  least-squares  with  forgettmg  factor  algorithm.  The  leaa.-squares  algorithm  ts  mod.hed 
by  weighting  the  data  exponentiaUy,  with  older  data  having  iess  weight  than  current  data.  re 

resulting  algorithm  is  well  known. 


e[n]  =  P[n]  '^j 

if  P[n]z[n  +  l]z'^[n  +  l]P[n] 

P[n  +  1]  =  1 -  A  +  zT[n  +  l]P[n]z[n  +  1] 


P\n]z[n  +  l]{u^[n  + i]  -  z'^[n  +  1]^M)  (32) 

e[n  4-  1]  =  +  x+  z'^[n  +  l]P[n]z[n  +  l] 

Equations  (29)  and  (30)  rupruso.d  tl«  l.a.cl,  fo,„,  of  fl.c  leas.-squaros  svi.h  forgetting  factor  algo¬ 
rithm,  while  (31)  and  (32)  are  the  recursive  fornr.  The  parameter  A  must  be  positive  and  less  than 
E  Typically,  the  range  is  restricted  to  0.95  to  0.99  lArr  0.99,  the  equivalent  time  constant  of  the 

exponential  weighting  is  equal  to  100  samples. 

stabilized  RLS  with  Forgetting  Factor:  The  least-squares  with  forgetting  factor  algo¬ 
rithm  unfortunately  exhibits  a  new  problem.  The  algorithm  becomes  unstable  if  there  is  insufficient 
excitation.  This  is  often  the  case  with  aircraft,  as  stead,  level  flight  does  .rot  prov.de  snflrcierrt 
excitation  for  convergence  of  the  parameters.  The  regressor  vector  a  must  be  persistently  ereiUny 
to  guarantee  convergence  of  the  parameters  to  the  nominal  values  [25].  One  possible  solution  is  to 
add  a  smaU  perturbation  to  the  controls,  such  as  white  noise.  However,  it  will  affect  the  flight  of 
the  aircraft  (and  may  be  a  poor  choree  for  a  combat  aircraft  which  is  tracking  a  target,  or  while 
landing).  Another  solution  is  to  use  a  stabilized  version  of  the  least-squares  with  forgetting  factor. 
A  new  stabihzed  algorithm  was  recently  proposed  in  121  (based  on  a  concept  proposed  in  (30])  and 

is  discussed  now. 

The  stabilized  least-squares  algorithm  is  obtained  by  including  an  additional  term  m  the 
error  function  used  by  the  least-squares. 
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The  additional  term  penaUzes  changes  in  the  parameter  matrix,  6.  Setting  0  yields 

z[k]u'^[k]X^'^  +  ae[n  -  1]^  (34) 

but  it  is  not  truly  a  batch  solution, 


This  is  the  equivalent  of  the  batch  solution  of  the  lea^t-squares, 

because  of  the  dependence  on  e[n  -  1]-  The  matrix 

^  z[k]z'^[k]y^  +  a/ j 

Vfc=i  ^ 

defined  as  the  covariance  matrix  of  this  algorithm.  A  recursive 


-1 


(35) 

form  for  the  inverse  of  the 


IS 


covariance  is 


p-^[n]  =  XP-'[n  -  1]  +  z[n]z'^[n]  +  a(l  -  A)i 
with  the  initial  condition  =  «■(.  TI,o  recursive  formula  for  0  is 


(36) 


_  1,  +  Pi,.|s[.ri(.qn)  -  - 11) + -  ‘1  - 

One  problen,  is  to  transform  (36)  into  a  recursion  for  Fin],  The  recursive  leas.-s,uares  and 
least-spnares  with  forgetting  factor  algor.tl.ms  make  use  of  the  matrix  .aversion  lemma 

(A  +  BC)-‘  =  A-'B(/  +  CAB)-'CA-'  P8) 

in  the  recursive  update  of  F[nl.  The  inversion  of  the  matrix  (/  +  CAB)  is  simplified  because  the 
product  xnnlF[n  -  Hxln)  is  a  scalar.  For  the  update  law  (36).  the  matrix  inversion  lemma  can 

Still  be  used,  but  with  B  =  defined  as 

B  =  (aln)  t/MTA)d  (“> 


However  the  product  P{n  -  US  is  not  a  scalar,  but  a  matrix  whose  dimension  is  one  greater 
than  PW.  It  would  be  easier  to  update  p-'N  and  invert  it  than  to  update  P). 


n  . 
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An  alternate  solntion  is  to  replace  o(l  -  A)/  i"  ‘he  update  law  by  mo(l  -  X)e.ef,  where 

e-  is  a  vector  of  aeros,  except  the  position  which  is  one,  and  m  IS  the  dimension  of  Z.  As  time 
progresses. .  is  incremented  and  returned  to  one  when  the  end  of  the  vector  is  reached.  The  nratrix 

B  becomes 

(40) 


5  = 

With  this  modihcation.  the  matrix  -  HB  is  only  -2x2,  which  is  easily  inverted.  Averaging 

analysis  (21  shows  that  the  averaged  system  responses  are  identical  for  both  implementations, 

•  •  dch  htdtrb  recursive  or  a  periodic  batch  implementation.  This  algo- 

A  third  option  is  a  pseudo- batch  recursive,  oi  i 

,  •  i-  If  ..rad-ttpr;  0  oiilv  Gvcrv  N  samplcs.  In  this  way, 
rithm  updates  the  P''  matrix  each  iteration,  but  npdale,  »  only  every 

p-i  is  only  inverted  every  N  samples.  The  error  criterion  for  this  implementation  ,s 

E(0ln])  =  -  wZ-ItldlnDU""-'-  +  «l«lnl  '  -  ‘It 

fc=l 

where  is  the  parameter  estimate  based  on  nN  data  points.  Solving  f  =  0,  we  find 

91-1  =  fZ  cltlabfclA”«-‘  +  of)  (E  altl»^  WA"""'’  +  o»l»  "  'l) 

Let  R[n]  =  P-^[n\e[n].  Then  P-^[n]  and  il[n]  are  updated  according  to 


(41) 


(42) 


p-ir^i  _  p-'^[n  -  l]  + a{l  -  X^)I +  ^+al 

L  J  j=l 

N 


(43) 


R[n]  =  x^P-^[n-l]e[n-l]  +  oc0[n-l]-aX^'0[n-2]  +  J2^z[n\u^[n]X^  ^  (44) 


where  j  =  k-{n-l)N.  Every  N  samples,  6  is  updated  by 

0  =  P[n]R[n] 


(45) 


where 


P[n]  is  computed  by  inverting  P  M”]- 
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The  stabilized  recursive  least-s^uares  with  forgetting  factor  has  the  property  that  the  eo- 

v^iance  matrr.  and  its  inverse  are  bounded  P,.  The  only  condrtiou  is  that  is 

,  -ru  -a  urn  was  used  in  the  thesis,  in  part  because  it  only  required  the 
approximate  algorithms  using  (40)  was  usea  in 

inversion  of  a  matrix  of  size  2x2. 


2.3  Reference  Model  and  Auto  Pilot 

reference  control  law  is  for  the  airplane  with  feedback  to  have  dynamics 


The  objective  of  the  model 


chosen  model.  The  reference 


model  must  have  the  same  relative 


which  approximate  those  of  a 

degree  as  the  plant,  but  ,s  otherwise  arbitrary.  The  reference 

input  vector  is  u'’  =  (^/f  ^r)  itnd  the 


model  chosen  for  the  reconfiguration 


^  _a_/3,  with  a  =  2.5  (c/.  [15]).  The  i. 


application  is  /R-s)  — 
output  vector  =  (g  P^')- 

,f  we  assume  that  the  reference  model  is  matched,  an  auto  pilot  can  also  be  designed  around 
that  model.  One  choice  for  an  auto  pilot  Is  one  that  tracks  the  angles  d,  and  rp.  Assummg  that 
the  reference  model  is  matched,  we  have 


q  =  —2.5^  T  2.5gc 

■p  — 2.5p  T  2.bpc 


(46) 


r  =  -2.5r  +  2.5rc 


where  qc-,  Pc  r< 


are  the  elements  of  the  reference  input  in  (2).  We  can  use  the  relationships 


0  =  q 
^  =  P 

i/)  =  r 


(47) 
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with  the  commands 


qc  =  #c-^) 

Pc  =  9(^c  - 

rc  =  aii’c  -  i^) 

so  that  the  angles  track  the  desired  angles  with  the  transfer  function 

2.5g 

5^  -f  2.5s  -h  2.5<7 

With  the  constant  g  set  to  1.6,  the  closed-loop  poles  are  at  -1.25  ±  J  1-56 


(48) 


(49) 


3  Implementation 


3.1  AIAA  Model 
The  algorithms  discussed  above  were  implemented  on  a 


FORTRAN  simulation  of  a  high  perfor- 


mane,  twia  eagma  j«  airplane  (S).  The  simulation  includes  nonlinear  aerodynamics  over  tire  entire 
operational  envelope,  as  rvell  as  thrust  and  engine  response  data.  Some  of  the  relevant  parameters 
for  the  simulated  aircraft  are  included  m  Table  1. 


3.2  Batch  LS  Results 

Our  first  implementation  of  model  reference  control  was  to  use  a  batch  least-squares  identification 
off-bne,  with  data  collected  from  the  AIAA  simulation.  Both  the  indirect  and  direct  input  error 
methods  were  used.  Control  performance  was  found  to  be  srmilar  m  both  cases,  and  the  results 
shown  here  are  for  the  direct  input  error  algorithm.  The  iderrtifications  were  performed  at  the  flight 
condition  of  Mach  0.5  at  altitude  9,800  feet.  Identifications  were  also  performed  for  an  aircraft  on 
wluch  the  left  horizontal  tail  surface  had  become  stuck.  Ten  seconds  of  data  were  used  to  perform 
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Parameter 

Symbol 

Value 

- * - 

W 

45,000  lb 

Weight 

c 

o 

CO 

'  o 

Wing  Area 

/> 

42.8  ft 

Wing  Span 

u 

15.95  ft 

Mean  Aerodynamic  Chord 

Moments 

4 

28,700  slug/ft^ 

h 

165,100  slug/ft^ 

h 

187,900  slug/ft^ 

i Til 

0  slug/ft^ 

l.z 

-520  slug/ft^ 

lyz 

0  slug/ft^ 

Table  1:  Parameters  Used  iir  tire  AIAA  Sirnulatiorr 

system  matrices  A,  B,  and  d  were 


used  to  check  the  validity  of 


each  identification .  The  identified  ^  ^ 

the  assumptions,  in  particular  the  invertibiUty  of  the  high-frequency  gam  matrix  and  the  mimm 

pHa.e  .3U.P.O.  If  tKe  plan.  Lad  aof  .eea  .Ini.u.  p.as.  a  a«e.Pt  cloi.  pf  cafpal  couM  avc 
,eep  paea  fo  acUeve  .ipin™.  plaae.  SpecI«caU.  ,  apa  P  cop.a  .ave  rep.acea  1,  ,  .  A.. 


then  used  to  control  the  airplane.  In  turn,  each  of 


and  r  —  Ii0^  [15]- 

The  identified  matrices  Co,Go,^^  were 

TKe  results  for  tlte  afreet  fuput  error  laeutlficuf.ou  are  sKowp  f„  Figure  .  Tire  e.pee.ea  output 
,  tire  SOM  Uues,  a„a  t.e  ueirreeea  output  p  is  sForeu  as  aasirea  iiues.  Tire  ae.rreuea 


output 


closely  matches  the  expected  output. 


of  the  responses  from  tests  with  the 


aircraft  after  failure,  using  the 


Figure  3  shows  some 

uufailea  aircraft.  As  oue  wouM  expect,  perforurauce  degrades,  with  a 


matrices  identified  for  the 
strong  cross 
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Figure  3:  Batch  LS;  Rcspoases  for  aircraft  with  failure,  using  parameters  of  unfailed 
from  the  failed  system  compensate  adequate,,  for  the  failure.  Performance  degradation  is  minor. 


except  for  a 


residual  cross-coupling  from  pitch  to  roll. 


3.3  Recursive 


The  next  implementation 


ive  LS  with  Forgetting  Factor  Results 


recursive  least-squares  with  forgetting 


factor  (RLSFF)  algorithm 


for  direct  input  error 


adaptive  control 


.  Performance  was  tested  in  the  same  manner  as 


it  was  tested 


for  the  off-line 


ine  batch  LS  identification.  In  turn,  each  reference  input 


was  given  a  series  of  step 
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Figure  5:  RLSFF;  Responses  for  aircraft  without  failure 

wMle  the  othet  iape.s  re^ataed  The  ach.eveh  output  is  cotapated  to  the  output  of 

the  refereace  model.  As  caa  be  seea  la  figure  5,  the  system  output  follotvs  the  refereace  model 

hatch  LS  algorithm  (dashed  llaes)  to  those  oh.alaed  Item  the  RLSFF  algorithm  (dash-dot  hues). 
The  performaace  of  the  two  algorithms  .s  comparable,  except  that  cross-couphags  are  s.ga.ficaatly 

reduced  with  the  recursive  least-squares  algorithm. 

The  m.a  purpose  of  usiag  a  recursive  algor, thm  is  so  that  the  eoatrol  galas  ca.  adapt  to 
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Figure  8:  RLSFF,  Adaptutiou  (o  failure:  failure  occurs  at  t  -  310  secouds, 

eirauges  iu  tire  plaut.  As  before,  a  failure  .u  tire  left  borizou.al  tail  surface  was  simulated.  Tire 
RLSFF  algorithm  chauges  the  control  parameters  to  adapt  to  the  failure,  restormg  system  perfor¬ 
mance.  Figure  8  shows  pitch  rate.  roU  rate,  and  yaw  rate  of  the  reference  model  (solid  lines),  and 
for  the  simulated  airplane  (dashed  hues),  when  the  pitch  rate  command  wan  given  step  changes. 
Tire  time  period  from  305  seconds  to  310  seconds  shows  the  response  of  the  aircraft  without  failure, 
using  the  RLSFF  identihcatlon  of  the  control  parameters.  At  310  seconds,  the  left  hormontad  ta.l 
surface  becomes  stuch.  InitiaUy.  much  of  the  model  trachrng  is  lost,  as  seen  from  310  seconds  to 
315  seconds.  But,  as  seen  from  315  seconds  to  320  secorrds.  performance  improves  as  the  control 

parameters  adapt  to  the  changes  in  the  plant. 
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■*?SO 


-  RLSFF;  Low  excitation  results  m  instability 


Figure  9 

3.4  Stabilized  RLSFF  Results 

Tl.  recursive  .e.st-s,uares  witir  forge.Urrg  tacror  aigorUlr.e  beco.ees  unstaUe  when  .Irere  is  insuffi- 

cierr.  eaciratioa.  F.gore  9  shows  the  (I.l)  ““ 

matrix  P.  Duriag  a  period  of  iosafficieot  excitatioa,  such  as  durmg  steady  level  fl.ght,  the  matt, 

become  uhbouhded.  This  iustahiUt,  is  the  motivat.ou  for  the  stahihsed  RLSFF  algonthm. 

its  urrmodihed  for.,  the  stabilised  recursive  ieast-sguares  with  forget.irrg  factor  .dent,- 

flcation  algorithm  requires  the  inverse  of  a  mxm  matrix,  where  m  is  the  numb 

f„  our  case  m  =  9.  The  modihcatiou  proposed  rn  eguatious  (39)  and  (40)  allows  implementat.on 
of  the  algorithm  using  the  Inverse  of  a  3x2  matrix.  This  reduced  algorithm  was  implemented  on 
the  airplane  simulation.  The  test  with  the  per.od  of  guiet  was  performed  agarn,  with  the  stabiUsed 
RLSFF.  Figure  10  shows  that  the  covariance  and  parameter  matrices  remain  stable.  Frgure  11 
shows  the  results  of  a  similar  test.  In  this  experiment,  a  failure  occurs  in  the  middle  of  the  per.od 
of  puiet.  During  the  period  of  quiet,  the  algorithm  does  not  adapt  to  the  failure,  because  there 
is  no  excitation  which  can  be  used  to  rdent.fy  the  changes  in  the  parameters.  But  when  there  .s 

excitation,  the  algorithm  adapts  to  the  farlute. 

One  issue  that  presented  itself  was  that  numerical  errors  caused  the  covariance  matr.x 
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Fio-ure  10:  Stabilized  RLSFF:  Low  excitation  does  not  cause  instability 


Figure  11:  Stabilized  RLSFF,  aircraft  with  failure,  at  t  -  250  seconds 


would  consist  in  using  a  square-root  algorithm.  This 


was  not  found  to  be  necessary,  however. 


3.5  Auto  Pilot  Angle  Tracking 

....  o  q  that  could  be  used  to  control  the  angles  0,  </>, 

An  auto  pilot  was  described  in  section  2.3  that  couia 

^  of  the  airplane.  This  auto  pilot  was  implemented  with  the  batch  LS  identihcation  matrices, 

lor  demonstration  of  the  concept.  The  gain  in  (48)  and  (49)  was  chosen  to  be  p  =  1.6,  so  that 

1  1  H  ^t  1  25  +  7  1  56  If  the  reference  model  is  exactly  matched, 

the  closed-loop  poles  were  located  at  -1.25  ±J  ■  ■ 

,  ,  describes  the  relationship  from  each  angle  comirrand  to  the  output 

the  transfer  function  75.^2.55+4 

angle. 

The  auto  pilot  was  tested  by  hold.ng  two  of  the  arrgle  commands  constant,  while  making 
step  changes  in  the  third  command.  Figure  12  shows  some  of  the  results  from  the  tests.  Note  that 
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there  is  a  constant 


Figure  12:  Tracking  of  «,  and  t/>  by  the  auto  prlot 
command  for  «  of  4^561  degrees.  This  is  the  angle  of  attack. 


4  Conclusions 

We  had  three  nradn  goals  when  designing  a  reconfignrah.e  hight  control  system.  First,  we  wanted  a 

the  nominal  trim  values.  Second,  we  wanted  the  system  to  have  its  three  inputs  and  three  outputs 
decoupled,  fn  the  aircraft  w.thout  a  farlure,  th.s  is  easily  achieved,  because  of  symmetry.  Once 
a  failure  occurs,  the  aircraft  may  lose  its  symmetry,  makirtg  decoupUng  more  difficult.  Th.rd,  we 
needed  a  closed-loop  system  that  would  track  the  desired  output.  We  have  demonstrated  that  w.th 
model  reference  adaptive  control  it  ,s  feasible  to  achieve  ah  three  of  these  goals  with  sat.sfactory 

results. 

The  input  error  direct  algorithm  has  important  advantages  over  other  algorithms.  The 
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indirect  algorithm  requires  more  parameters  to  be  ideutihed.  45  versus  27.  Since  there  are  a  large 

direct  algorithm,  but  has  more  rigid  stabiUt,  conditions  than  either  the  indirect  or  input  error 
direct  algorithms.  Also,  least-squares  adaptation  algorithms  can  be  used  with  the  indirect  and  input 
error  direct  algorithms,  but  not  with  the  output  error  direct  algorithm.  Least-squares  algorithms 
have  faster  coiivergeiice  tor  large  iiuinbers  of  parameters  than  gradient  descent  algorithms,  ivhicl. 

is  critical  for  an  reconfiguration. 

The  recursive  leas, -squares  with  forgetting  factor  algorithm  is  often  used  in  adaptive  control. 
However,  the  algoritlim  suffers  from  instability  if  there  is  not  sufficient  excitatio 
Since  sitnations  with  httle  cacitation  are  coinmoii.  eg.  steady  level  flight,  this  is  a  serions  issue.  We 
have  investigated  a  new  algorithm  which  avoids  this  problem:  the  stabihzed  recursive  least-squares 
with  forgetting  factor  algorithm  (SRLSFF).  The  SRLSFF  achieves  stabibty  during  periods  of  low 

results  in  an  algorithm  with  relatively  weak  conditions  necessary  for  stabihty. 

There  are  many  related  issues  which  remalu  to  be  studied.  The  performance  of  these  algo¬ 
rithms  should  bo  tested  when  there  is  noise  present  in  the  system.  Cursor,  testing  has  shown  that 
the  identifleation  algorithms  were  not  siguiflcan.ly  afleeted  by  the  noise,  but  control  preformance 

testing,  both  with  other  failures,  aud  at  flight  conditions  throughout  the  operational  envelope  of 
the  aireraff.  Except  as  noted,  all  tests  were  conducted  near  a  single  operating  point.  At  other 
flight  conditions,  it  may  be  better  to  coiitro.  the  aircraft  based  on  other  criteria.  One  such  criterion 
would  be  control  of  the  aircraft  based  on  the  acceleration  experienced  by  the  pilot. 
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Technical  Notes  and  Correspondence. 


Multivariable  Adaptive  Control;  Identifiable 
Parameterizations  and  Parameter  Convergence 

Michel  dc  Mathelin  and  Marc  Bodson 

algorithm  for  aSrithm  requires  fewer  parameters  than 

the  parameters  converge  to  their  nominal  valu« 
of  exciLion  conditions.  Simple  frequency  d^omarn 
under  persistency  o  derived  to  satisfy  these  conditions. 


I.  Introduction  c 

The  extension  of  single-input  singIcKiutput  adaptive  control  resuks  t 
lollu  nablc  systems  has  been  considered  by 

cite  a  few  in  the  deterministic  framework:  Elliot  and  Wolov.ch  [1]  I 

TJ,  »d  N^dn,  121.  »d  Dus.,d  2,  21.  131.  Tl.c 

thei  papers  has  been  on  finding  parameterizations  for  the  controllers  . 

Sg  »»iv  <“  >  f""  “2  “ 

to  reduce  the  amount  of  d  priori  information  required. 

Sose  inspection  reveals  that  parameter 
anteed  for  schemes,  even  with  rich  input  signals,  because  y 
rel^n  parameterizations  that  do  not  define  the  par^eters  unique  y 
The  issue  is  the  dual  of  the  issue  of  identifiabiUty  in  parameter 
estimation.  A  parameterization  is  a 

evstems  which  associates  a  vector  of  parameters  to  each  system.  F 
l-mear  time  invariant  systems,  one  usually  says  that  a 
ic  identifiable  if  each  transfer  function  corresponds  to  a  unique 
paAr^Slr  vLr.  Similarly,  in  direct  adaptive  control,  the  controller 

I  parameterized  so  that  a  vector  i 

toTch  plant  Ihen,  one  may  say  that  the  parametenzauon  ts 

identifiable  if  the  controller  parameters  are  uniquely  defined  for  each 

‘’'Tn  this  paper,  we  present  a  direct  model  reference  adaptive  control 
fMRAO  alrorithm  for  multivariable  systems,  using  a  parameten- 

we  give  simple  frequency-domain  conditions  on  the 

to  guariitee  persistency  of  excitation  and.  hence,  parameter 

“u  "Sonable  to  ask  why  identifiable  parameterizations  would 
useful  in  adaptive  control.  The  first  reason  is  that  ^^en  nori|d 
fiable  parameterizations  are  used  in  adapuve 

persistency  of  excitation  on  the  regressor  vectors  cannot  be  satisfied 
Manascnpl  received  February  .3,  1992: 

rclerPorndito:  GrS  bS 

w^lhe  ?ep“  mlmof  Electrical  Engineering,  University  of 

Utah.  Salt  Lake  City.  UT  84112  USA. 

IEEE  Log  Number  9401675. 


in  general,  even  with  rich  signals,  so  that  the  par^eters  may  not 
converge  to  unique  values.  Since  the  set  to  w  ic  e  para 
converge  is  usually  unbounded,  the  convergence  of  the  parameter 
1  s^ogi.™  .0  Small  maaaua^M  noma 

unmodeled  dynamics  can  easily  force  convergence  the  Pa«n|«era 
to  regions  of  the  parameter  space  where  the  system 
Conirsely.  with  identifiable  parametenzations  and 
inputs,  a  certain  degree  of  robustness  to  no.se  and  unmode  ^ 
dynamics  is  always  guaranteed  and  the  parameters  will  remain  in  the 
neighborhood  of  their  nominal  values  (cf.  [4]).  An  example  prwented 
in  sLion  V  illustrates  the  difference  in  robustness  provided  by  an 
identifiable  parameterization. 

Another  advantage  of  identifiable  parametenzati^s  is  that  bey 
usually  require  a  smaller  number  of  parameters.  This  number  a  - 

Idy  tenl  to  be  large  for  multi-input/multi^utput  systems  and 

computational  requirements  grow  fast  with  the  number  of  parame^ 
ters  especially  for  least-squares  algoritlims.  On  the  other  hand, 
disadva^ge  is  that  the  observability  indices  of  the  plant  need 

known,  as  opposed  to  an  upper  bound, 

An  important  contribution  ol  this  paper  is  1.  c 
simple  frequency  domain  conditions  on  the  inputs  that  gu^antc 
parameter  convergence  to  the  nominal  values. 

Lre  obtained,  using  generalized  harmonic  analysis,  m  mult.vanable 
recursive  identification  by  de  Mathelin  and  Bodson  [5], 
input/single^utput  (SISO)  adaptive  control  by  Boyd  and  Sas^  [  ]• 
Hwever  the  conditions  in  multivariable  adaptive  control  are  no 
Lai  ex’tensions  of  the  conditions  in  the  SISO  -e.  ‘^Tn 
difference  with  the  SISO  case  is  that  the 

conditions  for  parameter  convergence  may  _  1  j 

parameter  convergence  may  depend  on  the  location  of 
components  of  the  inputs.  For  the  same  number  of  frequences  in  th 
inputs,  convergence  may  depend  on  the  locatmns 
This  is  not  the  case  with  SISO  systems  where  only  the  number  o 

frequencies  is  relevant. 


U.  Model  Reference  Control 
Assume  that  the  plant  is  of  order  u  and  is  described  by  a  square, 
stricn;  proper,  and  minimum  phase  funCmn 

matrix  P(s)  e  the  set  of  p  x  P  matrices  whose  element 

iS  LcUons  of  2.  U..  fo.lo«i.l!  cooooiw  sm,c™  « 
considered 


u  =  Cor  +  A  'C[u]-FA  '^D[yp 


■■  Mo(ro] 


where  ir  is  the  vector  of  inputs  of  the  system. 

reference  signals,  Co  G  is  nons.nguto,  ^ f  A 

A  r  1  ru't  D(<.1  G  the  set  of  p  X  p  matnees  whose 

«  ™  polyooilf  l»  -  r—  «"<4  A- 

transfer' function  matrix,  and  A(s)  is  a  Hnr-iU  dragona^ 

A(s)  =  diag  {A.(s)).  such  that  A^  D  is  proper  and  A  C  is  stnctl> 

^  ^  combining  the  equation  of  the  planL  Vp  = 
equation  of  the  controller  (1).  the  output  Vp  can  be  expressed 

yp  =  :Vr((A  -  C)Dr  -  DNR)~'^Co[r\  (2) 
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where  {Nr^s),  Da{s)}  is  a  right  matrix  fraction  description  (MFD) 

of  P{s),  i.e..  P  =  NrD[j^\  rr/  ^ 

Let  the  reference  model  M(s)  =  H{s)Mo{s),  where  ff(s)  ts  the 

Hcrmite  normal  form  of  P(s)  (see  [4]),  i.e.,  liirv,— «>  S  ^(s)F(s)  = 
Kp  (high-frequency  gain  matrix)  nonsingular  and 

__1 _  0  -  -  1 

t  Q 

/((s)  =  (.+a)--2-‘  (»+o)"2 

1 

■  (.+a)''F  J 

with  dh;j{s)  <  r,-  —  1 

where  a  is  aihitn..-/,  but  fixed  a  priori.  The  model  output,  y„,  is 
defined  as 

y„  =  HMo[ro]  =  F(r].  (3) 

Then,  it  can  be  shown  (cf.  Appendix)  that  if  5A;  >1^  —  1  and 
u  >  I'.n.x  (where  is  the  maximum  of  tne  observability  indices 

of  F(s)),  3Co  e  C'is),  D-{s)  e  solution  of  the 

Diophantine  equation 

Fk[(A  -C')Dn-  D'Na\-'KCo  =  H  W 

such  that  model  matching  is  achieved.  A-'  D*  is  proper,  and  ^  C 
is  strictly  proper.  In  particular,  Co  =  K,  nonsingular,  dD  < 
-  1.  and  3r.C-  <  dXi  (where  dr.C'  arc  the  row  degrees  of 
Cr  )*  An  important  result,  whose  proof  is  in  the  Appendix,  is  that  the 
matrices  Cl.C\D'  arc  unique  if  we  add  tlic  following  constraint 

on  tlie  solution 

dciD’  <vi-l  Vi  (5) 

whete  {i/.  }  arc  the  observability  indices  of  F(s)  and  Sc.-F*  are  the 
column  degrees  of  D  . 


III.  Adaptation 

The  matching  equality  (4)  is  equivalent  to 

/  =  CJF“‘F-1-A"‘C’ +  A"‘D'F.  (6) 

Define  L(s)  =  diag  {/(s)},  with  ((s)  Hurwta.  d^s)  >  d,  where  d  is 
the  maximum  degree  of  all  elements  of  H  (s).  Then,  muluplying 
both  sides  of  (6)  by  L"*  and  applying  both  transfer  function  matnccs 

to  u  leads  to 

If  H(s)  is  known  n  priori,  then  (7)  is  an  equation  where  the 
unknown  parameters  appear  linearly.  Indeed,  define  the  matrices 
C.*,---,C;_„  G  such  that 


The  matrix  of  unknown  controller  parameters,  0  — 

[Co  -  •  •  C;_i  -  -  -  Dl]  and  the  regressor  vector 

V/ = [(FL)-‘iypr(A^)-i«r— 

so  tliat  tlic  following  error  equation  can  be  derived  from  (7) 

e,  =  'I'  ® 


where  6  is  the  estimate  of  O'  and  d>  is  the  parameter  error.  At 
this  point,  e  may  be  estimated  using  a  standard  linear  estimation 
algorithm.  The  error  equation  is  equivalent  to  the  one  presented  in 
Elliot  and  Wolovich  [1]  and  included  in  Sastry  and  Bodson  [4].  It 
requires  the  a  priori  knowledge  of  the  Hcrmite  normal  form  n(s) 
and  an  upper  bound  r  on  the  observability  index  Then,  the 

number  of  parameters  to  identify,  Ne.  is  2p'^u.  Note  that,  for  stability 
considerations,  other  assumptions  may  be  needed  to  guarantee  that 
C^’  is  bounded  (cf.  [7]). 

To  guarantee  parameter  convergence  of  the  adaptive  scheme,  one 
finds  that  the  uniqueness  of  O"  must  be  guaranteed.  From  (5), 
the  observability  indices  {i/.}  must  be  known  and  6'  must  be 
constrained  so  that  daD'  <  vi  —  1  and  v  =  An  advantage 

of  uniqueness  is  that  the  number  of  parameters  Ne  becomes  smaller, 

Ne  =  p’l'max  +  Pti-  Note  also  that  Ne  =  2pn  if  the  observability 
indices  are  all  equal  and  that  Ne  =  2u  in  the  SISO  case,  as  expected. 

IV.  Parameter  Convergence 

Various  approaches  have  been  followed  to  guarantee  stability  of  the 
adaptive  scheme  and.  in  particular,  boundedness  of  all  the  signals.  The 
main  difficulty  wlicn  trying  to  prove  stability  is  that  the  estimate  Co 
must  slay  bounded  away  from  singularity.  The  simplest  proofs  assume 
that  initial  parameter  estimates  arc  available  that  arc  close  to  the 
nominal  values.  For  larger  initial  errors,  fixes  arc  required  to  deal  with 
the  singularity  regions  of  the  algorithm.  A  parameter  transformation 
to  avoid  singularity  regions  and  a  global  stability  proof  arc  presented 
in  dc  Mathclin  [7].  For  lack  of  space,  we  will  not  address  stability 
issues  here  and  proceed  to  study  convergence  properties  assuming 
that  stability  is  guaranteed. 

nie  condition  for  parameter  convergence  is  the  well-known  per¬ 
sistency  of  excitation  (PE)  condition.  It  states  that  if  the  regressor 
vector  t/>  is  PE.  then  Iim<_«  6  =  6’  (exponentially,  if  the  parameter 
estimation  algorithm  is  a  gradient  algorithm  or  a  recursive  least- 
squares  algorithm  with  covariance  resetting  or  fotgetting  factor).  The 
problem  with  such  a  condition  is  that  it  constrains  time  signals 
that  are  interrelated  and  not  freely  available  to  the  designer.  In 
particular,  the  condition  may  never  be  satisfied,  as  happens  when  the 
parameterization  is  not  identifiable.  Therefore,  parameter  convergence 
conditions  arc  not  of  any  real  use  unless  they  are  translated  into 
conditions  on  the  external  inputs  of  the  adaptive  system.  This  has 
been  done,  using  generalized  harmonic  analysis,  by  Boyd  and  Sastry 
[6]  in  the  SISO  case  and  by  de  Mathelin  and  Bodson  [5]  m  recursive 
multivariable  identification.  We  use  the  same  approach  here  to  obtain 
the  following  results.  Assume  that  the  identifiable  parameterizauon 
is  used.  . 

Theorem  I-Frequency  Domain  Conditions  for  Parameter  Conver- 
■.  gence-MRAC  with  Kp  Known:  Let  the  inputs  r;  be  stationary  and 
uncorrelated  (different  frequencies  in  each  input)  and  assume  that  the 
high-frequency  gain  matrix  Kp  =  C*"'  is  known,  so  that  the  control 
parameter  matrix  Co  does  not  need  to  be  estimated. 

1)  If,  for  all  i,  the  support  of  the  spectral  measure  of  the  rth  input, 
Sr.  (w),  contains  at  least  n  -f  -  p  different  values  of 
then  is  PE. 

2)  If  the  support  of  the  spectral  measure  of  the  inputs  taken 

altogether.  Sr{^)  does  not  contain  at  least  n  +  -  U 

different  values  of  lj,  then  rp  is  not  PE. 

Theorem  2— Frequency  Domain  Conditions  for  Parameter  Conver- 
1  gence-MRAC  with  Kp  Unknown:  Let  the  inputs  r;  be  stationary  and 
uncorrelated  (different  frequencies  in  each  input)  and  assume  that  t  e 
initial  parameter  error  is  sufficiency  small  that  Co  is  nonsingular  or 
that  a  parameter  transformation  is  used  to  avoid  singulanty  regions 
i)  (see  de  Mathelin  [7]). 
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TABLE  I 

Simulation  Results 


Case 

Inputs 

f.c. 

I 

ri  =  sin(2.0t) -f- sin(4.0t) 

4 

7^0 

r2  =  sin  (l.Ot)  4-  sin  (3.0t) 

4 

2 

ri  2=sin(2.0t)  +  sin(1.197S2411t) 

4 

0 

72  =  sin  (l.Ot)  +  sin  (3.0t) 

4 

3 

ri  r:sin(1.0<)  +  sin(3.0f) 

4 

0 

72  =  sin(2.0t)  + sin  (1.1978241  It) 

4 

4 

n  1.0  +  sin(2.0t) 

3 

0 

72  =  sin(1.0t)  +  sin(3.0t) 

4 

1)  If,  for  all  i,  the  support  of  the  spectral  measure  of  the  ith  input. 

Sr..(a/),  contains  at  least  n  +  -  p  +  1  different  values  of 

u,  then  i/i  is  PE. 

2)  If  the  support  of  the  spectral  measure  of  the  inputs  taken 

altogether,  SrCoj)  does  not  contain  at  least  ji +  different 

values  of  CJ,  then  t/i  is  not  PE. 

The  proof  of  Theorem  2  is  in  the  Appendix.  The  proof  of  Theorem 
1  is  a  trivial  simplification  of  the  proof  of  Tlieorem  2.  For  the 
interpretation  of  the  results,  it  should  be  noted  that  one  sinusoid, 
sin  (wot),  contributes  to  two  frequency  components,  at  +wq  and  —ojo- 
In  the  SISO  case,  the  sufficient  condition  for  parameter  convergence  is 
equal  to  the  necessary  condition.  As  expected  (cf.  [6]),  2n  frequency 
components  are  necessary  (with  unknown  kp).  Consequently,  the 
number  of  frequency  components  necessary  for  parameter  conver¬ 
gence  is  equal  to  the  number  of  parameters  Ne-  This  is  not  true 
for  multivariable  systems  where  fewer  frequencies  are  required  than 
the  number  of  parameters.  Generally,  in  tlie  multivariable  adaptive 
control  case,  the  sufficient  condition  is  different  from  the  necessary 
condition.  A  similar  result  was  found  in  the  identification  case  by  de 
Mathelin  and  Bodson  (5]  where  the  sufficient  condition  was  at  least 
n-bi/m,!  frequency  components  per  input  and  the  necessary  condition 
was  n  +pN„»x  frequency  components  in  the  inputs  taken  altogether. 
A  paiticulariy  interesting  fact  is  that  when  the  necessary  condition 
is  satisfied  but  the  sufficient  condition  is  not,  parameter  convergence 
occurs  in  certain  cases  and  not  in  others,  depending  on  the  location 
of  the  frequency  components  in  the  input.  This  peculiar  phenomenon 
proper  to  multivariable  systems  is  exposed  through  examples  in  the 
section  to  follow.  Furthermore,  it  can  also  be  shown  (see  [7])  that 
for  some  systems  and  for  some  particular  input  signals,  parameter 
convergence  occurs  in  the  identification  case  but  not  in  the  MRAC 
case  and  vice  versa.  It  is  also  interesting  to  note  that  there  is  no 
relationship  between  the  particular  frequencies  and  the  zeros  of  the 
system.  Indeed,  there  exists  an  infinite  number  of  different  input 
signals  for  which  this  particular  phenomenon  occurs,  and  it  occurs  in 
the  MRAC  case  for  different  signals  than  in  the  identification  case. 


V.  Examples 


A.  Example  I:  MRAC  Convergence 

This  example  illustrates  the  results  of  Theorem  2.  Let  the  plant 
be  a  two-input/two-output  system  of  order  four,  with  the  following 


transfer  function  matrix 


F(s) 


I 


»^+3»^-E3.+l 


The  system  is  stable  and  minimum  phase.  The  observability  indices 
are  i/i  =  2  and  1/7  =  2.  Let 

[(5  +  A)  0 

0  (^  +  •^) 


AW- 


(7  +  1) 

0 


0 

(f  +  i) 


M{s)  = 


(•+1) 

0 


(-+1)  J 

with  A  >  0  and  /  >  0.  Therefore,  the  unknown  parameter  matrix  is 
given  by  the  equation  at  the  bottom  of  the  page,  where  the  number 
of  unknown  parameter  Ne  =  1C.  The  regressor  vector  is  defined  by 

(s+l)!/,.,  (^+  I)yr2  "1 


rl’ 


I(s) 


1(3) 


Vp, 


/(s)(s  +  A) 

yp7 


/{s)(s  +  A)  l(s)(s  +  X) 


I(s)(s  -f  A) 

___  ^ 

1(3) 


yp2 

1(3) 


with  /(s)  =  j  +  1.  Finally,  tlie  sufficient  condition  for  parameter 
convergence  is  n  -f  Nmi.*  —  p  -f  1  —  5  f.c.  per  input  and  the  necessary' 
condition  is  ii  +  pr.n.x  =  8  f.c.  in  the  inputs  taken  altogether. 

To  illustrale  the  results,  it  is  convenient  to  compute  the  smallest 
eigenvalue  of  (cf.  proof  of  Tltcorcm  2).  This  number  is 

5^  0  if  i!’  is  PE  and  =0  otherwise.  Table  I  summarizes  the  results 
obtained  by  computing  A,/,„(0)  for  several  inputs.  Cases  1,  2,  and 
3,  with  four  frequency  components  (f.c.)  in  both  inputs  show  that 
when  the  necessary  condition  is  respected  but  tlie  sufficient  is  not, 
parameter  convergence  will  depend  on  the  location  of  the  f.c.  in  the 
input.  In  Case  1  there  is  parameter  convergence.  In  Cases  2  and  3 
the  parameters  do  not  converge  to  their  true  values.  It  was  found 
that  there  exists  an  infinite  number  of  values  for  the  f.c.  such  that 
convergence  does  not  occur,  so  that  those  values  arc  not  related  in 
any  obvious  manner  to  the  zeros  of  the  system.  Case  4  illustrates 
that  eight  f.c.  in  the  whole  input  are  absolutely  necessary  to  have 
convergence. 

We  also  simulated  the  behavior  of  the  MRAC  algorithm  under 
these  different  inputs.  To  increase  the  speed  of  convergence,  a  least- 
squares  algorithm  with  forgetting  factor  was  used  for  the  simulation. 
Fig.  I  shows  the  evolution  of  the  function  v(t)  =  /nlK^  —  ^')11  in 
Cases  1  and  2.  When  the  input  is  not  PE,  all  the  estimates  do  not 
converge  to  their  true  value,  and  limt  — *00  v(t)  =  u(oo)  ^  -00. 

8.  Example  2:  Robustness 

The  following  simulation  illustrates  the  fact  that  the  schemes 
using  identifiable  parameterizatidns  generally  have  better  robustness 
properties  than  those  using  nonidentifiable  parameterizations.  This 
example  is  one  of  several  simulations  that  we  ran,  adding  various 
types  of  noise  and  unmodeled  dynamics  to  the  ideal  systems.  In 
our  experiments,  we  found  several  cases  where  both  schemes  (using 
identifiable  and  nonidentifiable  parameterizations)  were  stable,  and 
several  cases  (as  the  one  presented  here)  where  only  the  scheme  based 
on  an  identifiable  parameterization  remained  stable.  The  reverse  was 
not  observed. 


1  2 
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A-2 

-1 
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A-i-  1 


A^  -  4A  +  3  2A^  -  5A  -P  4  3  -  A  3  -  2A 

1-A  A--2A-t-2  1  1-A 


Fig.  1-  v(t)  in  caic  1  and  2. 


Identifiable  Parameterization:  daD  <  vi  1-'  Assuming  that 
is  known  so  that  Cl  does  not  need  to  be  identified,  the  unknown 
parameter  matrix  is  given  by 


Fig.  2.  ll^oCOIl  identifiable  paramctcrizalion. 

Let  the  nominal  plant  be  a  iwo-input/two  output  system  of  order 
3,  with  the  following  transfer  function  matrix 


2.^4-3»— r 

D/  \  i^+3<ia+4.+2  7a+332+-4«d 

.+13 

-71+37t++i+2  »^+3.'^+<.H 
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where  the  number  of  unknown  parameters  Ng  —  10.  The  regressor 
vector  is  defined  by 

T  r  “1  U2  Vpi 

^  [l(s)(s+A)  /{s)(s  +  A)  l(s)(s  +  A) 

yp2  ypi 

l(s){s  +  X)  l{s)/ 

From  Theorem  1 .  the  necessary  condition  for  parameter  convergence 
is  three  f.c.  per  input  and  the  necessary  condition  is  five  f.c.  in  the 
inputs  taken  altogether. 

Nonidentifiable  Parameterization:  daD’  <  +  -  1  =  I'max  -  1- 
Thc  unknown  parameter  matrix  is  given  by 

+  (!■’' Ill  1  A -3  A -2  -1  -1] 

0  [^2  J 

where  A  i  and  kj  arc  arbitrary  constants.  Therefore.  Ng  -  12  and  the 
regressor  vector  is  defined  by 


The  system  is  stable  and  minimum  phase.  The  observability  indices 
afe  1/1=2  and  1^2  —  i* 

Af  1  R®  +  •^)  ®  1 

0  (a  +  a)J 


ivitli  A  >  0  and  /  >  0. 


A  simulation  of  tlic  MRAC  algorithm  was  made  for  both  param- 
ctcrizations  with  the  following  PE  reference  input;  n  =  sin  (2  * 
t)  +  sin  (4  .  I.)  and  rj  =  sin(t)  +  sin  (3  *  f)  and  with  unmodelcd 
dynamics  and  high-frequency  output  noise  added  to  the  nominal 
system.  The  unmodcled  dynamics  were  (144/ (s  -f  12)  )  on  the  first 
output  Vf,:  and  {289/(s  +  17)*)  on  the  second  output  yp^.  The 
noise  was  an  addiuve  output  noise  equal  to  0.1  sin  (20  ♦  t)  in  the 
first  output  and  equal  to  0.1  sin  (25  *  t)  in  the  second  output.  The 
unmodeled  dynamics  and  the  noise  are  more  than  a  decade  away 
from  the  dynamics  of  the  nominal  system  and  of  the  reference  model. 
Furthermore,  the  average  amplitude  of  the  noise  is  about  5%  of  the 
average  amplitude  of  the  reference  output.  For  the  simulations,  a 
least-squares  algorithm  with  forgetting  factor  was  used  to  increase 
the  speed  of  convergence.  Also,  a  stabilizing  term  was  added  to  the 
least-squares  algorithm,  sec  Kreisselmcier  [9],  to  prevent  divergence 
of  the  covariance  matrix  of  the  least-squares  algorithm  (which  can 
occur  with  the  nonidentifiable  parameterization).  The  parameters.  A 
and  I,  were  selected  equal  to  four  and  10.  and  the  initial  controller 
parameters  were  chosen  equal  to  zero.  The  norm  of  the  output 
error.  Hcoll,  is  shown  in  Fig.  2  for  the  identifiable  parametenzation 
and  in  Fig.  3  for  the  nonidentifiable  parameterization.  Clearly.  *e 
system  becomes  unstable  with  the  nonidentifiable  parametenzation 
and  remains  stable  with  the  identifiable  one  (much  longer  simulations 
were  run  to  verify  this  fact). 

Appendix 

Umma  I— Null  Matrix  Condition:  Given  {Na,  Dr},  a  nghl 
MFD  of  a  (p  X  rn)  strictly  proper  system  with  observability  indices 
{i/.}.  If  D(s)  6  ^Rs)  e  with  q  arbitrary,  are 

such  that 

D(s)Nn(s]  =  .V(s)Dfi(s)  -Vs  and  daD  <  u.  -  I 
then  D{s)  =  0  .V(.s-)  =  0. 
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nonsingular  and  BDl  =  r'mujc  (cf-  Kaila*  [10]).  Divide  A/vp  H 
on  the  right  by  Dl.  Then,  by  the  polynomial  matrix  division 
theorem  (see  [10,  pp.  388-390]),  3Q,R  £  such  that 

=  QDi,  +  jR  and  RDl'^  is  strictly  proper.  Furthermore, 

dciR  <  dciDL  <  t'max-  Then,  let 

D"  ^ -R  =  QDl- C'=h.-QNL 


It  is  easy  to  sec  that  the  given  CJ,  C*,i9*  solve  the  matching 
equality.  Furthermore,  since  dXi  —  i/  —  1,  A  ‘D  is  p'cper.  On 
the  other  hand 


-s. 


Fig.  3.  lleo(t)l|  with  nonidcnlifiablc  paramclcnzalion. 

Proof:  Define  D.(s)  and  A.(s)  as  the  itli  row  of  D{s)  and 
A'(s),  then  by  hypothesis 

D:{s)Nnis)  =  N,{s)Dn{s)  and 


limA-'C-=  lim(/-A7‘iir-V-A  ^ D' P) 

a— »oo  a  — oo 

=/-/=0 

so  that  A“'C*  is  strictly  proper  and  dr.C  <  9A.  -  1,  e.g., 
ar.C'  <  -2  if  3A.  =  - 1.  To  insure  the  uniqueness  of  the 

solution  to  the  Diophantine  equation,  we  choose  for  Dl  the  canonical 
left  MFD,  so  that  dc.DL  =  and  dc.D'  becomes  <  i/.  -  1.  Then, 
suppose  that  Co  +  ACo,  D'  +  AC,  D’  +  AD  is  another  solution 
to  the  matching  equality.  Tlien 

ACoir'  p  +  AC  +  ADP  =  0 


dcjD,  =  dD,j  <u,  -  I  Vj  Vi. 

Suppose  that  {A^l,  Dl)  is  the  canonical  left  MFD  (cf.  [9]),  i.e.. 
dcjDc  =  dTjDL  =  ‘'j  rc(D£.]  =  / 

DL{s)NRis)  =  Nl{s)Dr{s) 

where  FcIDlI  is  the  leading  column  coefficients  matrix  of  Dl-  Now. 
if  Uk  =  then  the  matrices  {Nl,  Dl}  defined  by 

jth  row  of  Dl  =  jth  row  of  Dl  Vj  /  k 
jth  row  of  A^Z  =  jth  row  of  Nl  Vj  /  k 
fcth  row  of  Dl  -  fcth  row  of  Dl  +  D, 
itth  row  of  Nl  =  A-ih  row  of  Nl  +  Ni 

form  another  left  MFD  (Vi)  with 

dcjDl  =  drjDl  =  ui  FclDIl  =  I 

DI{s)Nr{s)  =  NI{s)Dr{s). 

Given  the  uniqueness  of  the  canonical  left  MFD 

DI{s)  =  Dl{s)  ^  D.(s)  =  0  N,{s)  =  0  Vi 

^  D{s)^0  N{s)  =  0.  □ 

Model  Reference  Matching  Equality:  We  first  show  that  there  ex¬ 
ists  a  solution  to  the  matching  equality  (4) 

Nr[{R  -  C')Dr  -  D’Nnr^RC;  =  H 

^  ClH-'P=l-X-'C’-A-^D'P. 

Let  {A'l,  Dl}  be  a  left  coprime  MFD  of  P(s)  with  Dl  row 
reduced,  then  the  leading  row  coefficients  matrix  of  D/.,  F^D/,],  ts 


lim  ACoir'  P  =  ACo  Ap  =0  ACo  =  0 


=}.  ACDr  +  ADNr  =  0  and  9c.  AD  <  i/;  -  1 

bv  the  null  matrix  condition  (Lemma  1),  AC  =  AD  =  0  and  the 
solution  IS  unique. 

Proof  of  Theorem  2:  Define  the  transfer  function  matrix, 
as  the  transfer  function  between  ip  and  u.  and  define  the 
model  signals  ip„,  as  the  signals  ip  when  the  parameter  error  <^  =  0 

ip^=n^,^p-'n[r]  =  iu„Ar]-  (9) 

The  proof  follows  steps  similar  to  those  of  the  proof  of  Theorem  1 
in  de  Mathelin  and  Bodson  [5]  and  can  be  found  in  de  Mathehn  [7]. 
The  outline  of  the  proof  is  as  follows.  First,  it  is  shown  that  xp  PE  is 
equivalent  to  t&m  PE.  Then,  since  the  reference  input  r  is  assumed 
stationary  xp„,  PE  is  equivalent  to  Rq,„{0)  >  0,  where 
the  autocorrelation  of  xpm{t).  finally,  the  first  part  of  the  theorem  is 
proved  by  showing  that  R^„{0)  >  0  when  the  sufficient  condition 
is  verified  and  the  second  part  by  showing  that  R^^IO)  is  singular 
when  the  necessary  condition  is  not  verified.  Q 

References 

[1]  H.  Hliott  and  W.  Wolovich,  “A  parameter  adaptive  control  structure 
for  linear  multivariable  systems.”  IEEE  Trans.  Automat.  Contr.,  vol. 
AC-27,  pp.  340-352,  1982. 

[2]  R.  P.  Singh  and  K.  S.  Narendra,  “Prior  information  m  the  design  ot 

multivariable  adaptive  conuollcrs,"  IEEE  Trans.  Automat.  Contr..  vol. 
AC-29,  pp.  1108-1111,  1984.  . 

[3]  L.  Dugard.  G.  C.  Goodwin,  and  C.  E.  De  Souza.  “Prior  knowledge  in 
model  reference  adaptive  control  of  multiinput  multioutput  systems, 
IEEE  Trans.  Automat.  Contr..  vol.  29,  pp.  761-764.  1984. 

[4]  S.  Sastry  and  M.  Bodson.  Adaptive  Control:  Stability.  Convergence,  an 
Robustness.  Englewood  Cliffs.  NJ:  Prentice-Hall,  1989. 

[5]  M.  de  Mathelin  and  M.  Bodson,  “Frequency  domain  condmons  for 
parameter  convergence  in  multivariable  recursive  identification.  An 
tomatica.  vol.  26.  pp.  757-767,  1990. 


(6]  S.  Boyd  and  S.  Sastry,  "Necessary  and  sufficient  conditions  for  parame¬ 
ter  convergence  in  adaptive  control,"  Automatica,  vol.  22,  pp.  629-639, 

1986.  .  . 

(7]  M.  dc  Mathelin,  “Multivariable  adaptive  control:  Reduced  pnor  mforma- 
tion,  convergence,  and  stability,”  Ph.D.  dissertation.  Elec.  Comp.  Eng. 
Dept.,  Carnegie  Mellon  Univ.,  Pittsburgh,  PA,  1993. 

(8]  G.  Kreisselmeier,  “Stabilized  least-squares  type  adaptive  identifiers, 
IEEE  Trans.  Automat.  Conlr.,  vol.  33,  pp.  306-310,  1990. 

{91  S.  Beghelli  and  R.  Guidorzi,  “A  new  input-output  canonical  form  for 
multivariable  system,"  IEEE  Trans.  Automat.  Contr.,  vol.  AC-21,  pp. 
692-696.  1976. 

(10]  T.  Kailath,  linear  Systems.  Englewood  Cliffs,  NJ:  Prentice-Hall.  1980. 


CondiUonally  Minimax  Algorithm  for 
Nonlinear  System  State  Estimation 

A.  R.  Pankov  and  A.  V.  Bosov 


Abstract — In  this  note,  a  method  of  conditionally  minimax  nonlinear 
filtering  (CMNF)  of  processes  in  nonlinear  stocliastic  discrete-time  con¬ 
trolled  systems  is  proposed.  The  CMNF  is  derived  by  means  of  local 
nonparametric  opamizaUon  of  the  filtering  process  given  the  class  of 
admissible  filters  Sufficient  conditions  for  the  existence  of  the  CMNf 
are  considered,  and  the  properties  of  CMNF  estimates  arc  investigated. 
Results  of  the  CMNF  application  to  control  and  identification  problems 
arc  presented. 


I.  Introduction 

The  problem  of  nonlinear  system  state  estimation  is  a  very  impor¬ 
tant  part  of  the  general  control  problem  for  systems  with  incomplete 
data.  It  is  well  known  that  the  nonlinear  filtering  problem  involves 
a  stochastic  functional  equation  for  the  conditional  distribution  of 
the  system  state  given  all  observations  [1],  [2].  So.  the  estimation 
problem  in  the  general  case  b  an  infinite-dimensional  one,  and  its 
exact  solution  is  practically  impossible. 

Recently,  various  authors  tried  to  determine  some  properties  of  the 
nonlinear  system  that,  when  fulfilled,  provide  finite  dimensionality  of 
the  optimal  estimator  [3]-{8].  Investigations  show  that  the  situation 
when  the  optimal  nonlinear  filter  is  finite  dimensional  is  not  typical, 
and  even  in  some  very  simple  cases  may  not  be  the  case  [9].  Hence, 
we  support  the  idea  of  obtaining  the  finite-dimensional  estimators  by 
means  of  appropriate  approximations  of  the  optimal  nonlinear  infinite- 
dimettsional  estimation  algorithms.  Some  of  these  approximations  are 
well  known  and  widely  used,  such  as  the  linearized  and  the  extended 
Kalman  filters  (EKF),  the  second  order  filter,  and  many  others  {8), 
[lOJ.  Their  practical  utilization  shows  that  they,  mostly  being  only 
suboptimal,  have  some  undesirable  properties,  e.g.,  to  provide  biased 
and  even  divergent  estimates  [10],  [1 1].  So,  it  is  important  to  continue 
the  efforts  to  obtain  the  nonlinear  filters  which  are  finite  dimensional 
and  provide  estimates  with  improved  properties. 

In  this  note,  we  present  a  general  approach  for  finite-dimensional 
nonlinear  filtering.  It  is  based  on  the  idea  of  local  conditional 
nonparametric  optimization  of  the  filter  structure  given  the  class 
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of  admissible  filters.  Locally  optimal  filters  were  considered  by 
Pugachev  [12],  [13],  who  derived  the  conditionally  optimal  filter 
by  parametric  optimization  of  a  nonlinear  filter.  To  realize  the  last 
method,  one  needs  to  know  the  joint  characteristic  function  of  the 
system  state  and  the  filter  estimate.  This  function  can  be  obtained 
by  solving  the  special  nonrandom  functional  equation,  which  is 
rather  complicated,  and  it  takes  much  effort  to  obtain  the  desired 
solution,  especially  for  the  multidimensional  case.  Using  the  idea  of 
conditionally  optimal  filtering,  we  derive  a  conditionally  minimax 
nonlinear  filter  (CMNFj,  which  can  be  determined  by  a  priori 
computer  modeling  [14],  [15]. 

To  test  the  properties  of  the  obtained  algorithm  we  compare  the 
results  of  the  application  of  the  CMNr,  of  the  optimal  nonlinear  filter 
(ONF).  and  of  the  EKF  to  control  and  identification  problems.  Some 
simulation  results  arc  presented  in  this  note. 

II.  CMNF  Algorithm 

Assume  the  following  notations;  is  tlie  expectation  of  x, 

cov  {x,  y}  =  £{(x  -  E{x])iy  -  E{y})’'}  is  the  covariance  of  x 
and  y;  P(m;  S)  is  a  set  of  random  vectors  i  (probability  distributions 
Ei)  with  E{x)  =  m,  cov  {x,  x}  =  S;  A"*"  is  die  pseudoinverse  of 
A:  A  >  0  if  A  =  A^  and  is  positive  semidefinite;  A  <  D  if 
D  -  A>0:  ||rl|iv  =  (x^lRx)'^^  for  some  weight  matrix  W  >  0; 
||x|l  =  l|x||iv  for  W  =  I:  and  col(xi,  -  -  - ,  x„)  =  (xf,  -  -  ■ ,  xl}^; 
g'  £  argmin^gc /(s)  'f  fis')  =  iofsGC /C?)- 

Consider  tlic  following  discrete-time  dynamic  model 

fy„  =  V5„(y„_i,  u„,  u;„),  71  =  1,2,---;  yo  =  y, 

\z„  —  v„) 

where  y„  6  R"  is  a  state  vector,  {u„}  arc  the  independent 

white  noises.  tu„  £  R".  v„  £  R";  ij  €  R"  is  independent  of 
random  veaor,  z„  £  R*'  is  a  vector  of  observations; 
<p„(y,  u,  w)  and  tp„{y.,  v)  are  the  known  nonlinear  functions;  and 
_  u„(Z""‘)  £  R'  is  some  feedback  control,  where  Z"  = 

col  [Zn  ^"**»Zl}-  ^  ^ 

Let  us  consider  the  CMNF  for  the  process  {yn}  given  Z  .  Let 
^„(y,  u)  and  Cn(y,  z)  be  some  fixed  nonlinear  vector  functions  (the 
choice  of  these  functions  is  considered  in  Section  IV).  and  y„-i 
be  the  CMNF-cstimatc  of  y„-i  given  Z"  *.  Our  objective  is  to 
obtain  y„  using  y„-i,  z„  and  the  structural  functions  („(-),  Cn(-)  of 
CMNF  introduced  above.  Let  y„  =  (nCyn-i,  «n)  £  R^  be  the  basic 
prediction  for  y^  given  Z"  ~  * .  The  conditionally  minimax  prediction 
y„  €  R**  for  y„  is 

yn  =  a’„{yj  (2) 

where 

a*(-)£aigmin  max  P{lly«  -  «(yn)litv}  0) 
«()6/t  XeP(.mx,Sx) 

and  X  =  col(y„,y„)  €  P{mx,Sx),  A 
{«(-):  £:{|lQ(y„)llw}  <  oo}.  i.e.,  A  is  a  set  of  functions 

a(-);  R'  —  R'’  such  that  jE;{|la(y„)llw}  is  finite. 

Remark:  From  (2),  (3)  it  follows  that  y„  is  the  best,  in  a  minimax 
sense,  estimate  of  y„  given  the  “observation”  y„,  and  it  is  obtained 
under  the  assumption  that  the  only  known  characteristics  of  X  — 
col(y„,y„)  are  mx  and  Sx-  Note  that  the  mean-square  optimal 
estimate  of  y„  given  y„  is  y'  =  £{y,7  |  y„}  and  to  calculate  y„ 
we  need  to  derive  the  exact  distribution  of  X.  The  last  problem  is 
equivalent  to  the  initial  optimal  filtering  one. 
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Abstract-Ao  example  is  presented  of  an  adapdve  system  Urnt  b^om« 
unstable  when  a  multitone  reference  ‘ 

Stable  when  the  spectral  components  are  appUed  separately.  It  is  fur^e 
shown  that  another  scheme,  based  on  an  input  error  Itlrt^d  of  an  output 
^rror  in  the  adaptation,  does  not  exhibit  the  ^e  .lability  mechan^ 

The  technique  of  averaging  is  used  to  analyze  this  phenomenon.  The 
equilibrium  points  of  the  averaged  system,  railed  the  timed  va-es,  ^ 
fovestigated.  Results  are  given  on  the  number  of  tunrf  values,  their 
locatioL,  and  their  stability.  The  analysis  is  not  only  valid  for  a  speafic 
plant,  but  for  a  large  class  of  systems. 

I.  Introduction 

Robust  adaptive  control  has  been  a  major  area  of  research  in  recent 
years-  Indeed,  the  ability  of  a  conuol  system  to  maintain  stability 
in  the  presence  of  unmodclcd  dynamics  and  measurement  "O'se  is 
a  critical  consideration  in  practical  applications.  In  (6)  and  [1  ]. 

,t  was  shown  that  adaptive  algorithms  discussed  in  the  literature 
at  the  time  could  be  destabilized  by  relatively  minor  perturbauons. 

A  significant  research  interest  followed  to  improve  the  robustness 
of  adaptive  control  systems.  Modifications  of  the  update  laws  were 
developed  such  as  the  dcadzonc,  relative  dcadzone.  and  leakage  (see 
c  g  the  survey  of  [12]).  These  modifications  were  primanly  designed 
to  (Prevent  the  slow  drift  instability  that  occurs  when  inputs  are  not 
sufficiently  rich  and  outputs  arc  perturbed  by  measurement  noise. 

While  the  update  law  modifications  arc  essential  to  the  robustness 
of  adaptive  systems,  other  factors  are  also  important.  We  co^ntrate 
here  on  the  choice  of  adaptation  mechanism,  and  show  that  the  choice 
of  error  equation  can  lead  to  significant  differences  in  robustness 
properties.  The  starting  point  of  the  paper  is  a  new  ex^plc,  based 
on  the  example  of  [14].  A  case  of  multitone  instability  ts  presented, 
where  the  adaptive  system  becomes  unstable  for  a  sum  of  sinu¬ 
soids  even  though  it  is  stable  for  the  spectral  components  applied 
separately.  To  study  the  example,  we  apply  averaging  theory,  and 
study  the  equilibrium  points  of  the  averaged  system,  callrf  the  tuned 
values.  We  also  analyze  another  scheme,  based  on  an  input  error.  We 
show  that  the  adaptive  system  docs  not  become  unstable  with  the 
same  multitone  inpuL  and  we  support,  with  the  averaging  analysjL 
the  claim  that  the  scheme  is  intrinsically  more  robust  to  unmodcl^ 
dynamics.  Further,  we  explore  an  interesting  tuning  property  of  the 

Throughout  the  paper,  we  denote  by  ii(s)  the  Laplace  .transform 
of  a  signal  u(t).  P(s)  denotes  the  transfer  function  of  a  linear  ttme- 
invariant  operator  and  P[u(f)l  denotes  the  output  of  the  operator  wtffi 
input  ii{t).  P[corl  denotes  the  output  of  P(s)  with  input  co(  )  -  r{ 
(multiplied  in  the  time  domain).  Re[-]  and  Im[-]  denote  te  rea 
and  imaginary  part  of  a  complex  number,  respect.vely.  OE  ts  an 
abbreviation  for  output  error  and  IE  for  input  error. 
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II.  The  Rohrs  Examples  and  Multitone  Instabilities 

Wc  start  from  an  example  due  to  Rohrs  et  al.  [14].  A  model 
reference  adaptive  controller  is  designed  assuming  that  the  plant  is  a 
first-order,  linear  time-invariant  system 


Hs) 


s  +  ap 


u(s) 


(2.1) 


where  kp  >  0  and  ap  are  unknown.  Nominally,  kp  —  2  and  Op  —  I. 
The  control  law  is  given  by 


u{t)  —  Co{t)r{t)  -t-  do{t)yp{t).  (2-2) 


For  some  nominal  values  of  the  adaptive  parameters  cj  =  k,r,kp  *, 
=  (op  -a„ the  closed-loop  transfer  function  is  equal  to  the 
transfer  function  of  the  reference  model 


k.„  _  3  _  y„.is) 

^  ~  5  -E  n™  -f  3  f{s) 

The  adaptive  rules  for  co(t),  do{t)  arc  given  by 

cc  —  —gear,  do  —  —gcaVp 


(2.3) 


(2.4) 


where  5  >  0  is  called  the  adaptation  gain.  A  Lyapunov  analysis  leads 
to  the  conclusion  (cf.  (llj.  (ISj)  that  all  the  states  of  tlie  adaptive 
system  are  bounded  and  that  eo(t)  — *  0  as  t  — »  oo. 

Rohrs  etaL  [14]  observed  tJiat  this  adaptive  system  could  be  easily 
destabilized  with  the  addition  of  noise  and  unmodeled  dynamics. 
The  unmodclcd  dynamics  under  consideration  were  of  the  form  of 
muluplicative  linear  dynamics,  so  that  the  true  plant  was  given  by 


P(s) 


2  229 

7TT  +  30s  +  229 


(2.5) 


instead  of  (2.1).  A  consideration  was  that  the  added  poles  were  well- 
damped.  stable,  and  far  in  the  left-half  plane,  i.e.,  that  the  unmodeled 
dynamics  were  “mild”  by  classical  standards.  The  instabilities  de¬ 
scribed  by  Rohre  et  aL  were  studied  in  detail  by  Astrom  [2]  (a 
full  discussion  is  also  available  in  [15]).  A  main  contribution  of 
the  examples  was  to  show  what  can  go  wrong  when  insufficient 
or  improper  excitation  is  provided.  However,  it  has  often  been 
deduced  from  these  examples  that  the  more  excitation  there  is  in 
the  desirable  frequency  range,  the  better  things  are.  The  reality  is 
nwrc  complex.  Indeed,  consider  the  adaptive  system  with  the  plant 
(2.5)  and  no  measurement  noise.  For  a  single  tone  at  tu  =  1  rad/s, 
simulations  show  that  the  adaptive  system  is  stable,  and  co(f)  -» 1.69. 

-1.26.  For  tu  =  10  rad/s,  the  adaptive  system  is  also  stable, 
and  co(/)  -*1.04,  do{t)  -*  -7.31.  However,  if  we  let  r(t)  be  a 
multitone  signal 


r(t)  =  sin(t) +0.3sin(10i)  (2-6) 

the  adaptive  system  becomes  unstable.  The  parameter  responses  for 
the  single  tone  and  multitone  signals  are  shown  in  Fig.  1.  The  initial 
conditions  used  in  the  simulations  are  those  of  [14],  that  is,  all  zero 
except  co(0)  =1.14.  do(0)  =  -0.65.  The  adaptation  gain  is  p  =  1. 

Note  that  there  is  plenty  of  excitation  in  the  reference  input  signal, 
and  most  of  it  is  in  the  desirable  frequency  range.  In  addition,  there  is 
no  measurement  noise.  Tliercfore,  the  instability  cannot  be  explained 
by  the  same  arguments  as  for  tltc  Rohrs  examples.  The  example  also 


Co 

Fig.  1-  Mullitonc  instabilities.  do{co). 


contradicts  the  notion  of  “good”  and  bad  frequencies.  For  simpler 
adaptive  systems  (cf.  Uic  Ricdlc  and  Kokotovic  example  discussed 
in  Section  V),  some  frequencies  cause  instabilities,  others  do  not. 
but  their  effect  is  cumulative.  As  long  as  there  arc  more  good 
frequencies  than  “bad"  ones,  the  adaptive  system  remains  stable. 
Here,  the  addition  of  a  sin(t)  signal  to  sin(lOt)  (both  of  these  leading 
separately  to  stable  systems)  leads  to  instability.  This  is  a  highly 
nonlinear  effect,  absent  in  linear  control  systems.  We  might  have 
hoped  for  some  form  of  convexity;  that  adding  two  sinusoids  would 
lead  to  a  behavior  of  the  adaptive  system  intermediate  between  the 
responses  for  the  separate  components.  Such  is  not  the  case. 


III.  Input  Error  Versus  Output  Error 
Model  Reference  Adaptive  Control 
The  model  reference  adaptive  control  problem  can  be  solved  using 
several  approaches,  including  direct  and  indirect  techniques.  These 
approaches  have  similar  properties.  esUblishcd  analytically,  although 
there  are  some  differences  in  assumpuons  and  in  computations.  There 
are  two  main  approaches  to  the  model  reference  adaptive  control 
problem  with  a  direct  algorithm:  the  output  error  (OE)  approach,  aijd 
the  input  error  (/£)  approach.  The  OE  approach  is  the  one  used  in 
the  Rohrs  examples  and  is  developed  in  [11].  The  IE  approach  was 
introduced  in  [8]  in  discrete  time,  and  is  the  basis  of  the  scheme  of 
[7].  It  is  discussed  extensively  in  continuous  time  in  [15].  For  the 
example  considered  in  Section  II,  the  error  driving  the  algorithm  is 
given  by 

62  =  coj/p  +  doM[yp]  —  M[u\ 

with  the  adaptation  law 

Co  =  —gc2yp,  do  ~  —gc2M{yp\. 

The  scheme  is  a  simplified  version  of  the  general  scheme  discussed 
in  [15]  (in  the  notation  of  [15].  the  special  choice  of  L  was  made 
such  that  LM  =  1).  Also,  two  adjustments  to  the  scheme  that  are 
needed  to  prove  stability  have  been  omitted;  the  normalization  factor 
in  the  update  law,  and  the  projection  of  co  to  prevent  co  =  0. 

Consider  now  the  example  of  Section  11,  that  is,  the  Rohrs  example 
with  a  multitone  input  and  with  the  input  error  scheme  replacing  the 
output  error  algoritlim.  Fig.  2  shows  the  responses  of  the  adaptive 


(3.1) 

(3.2) 
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Fig.  2.  co(«)  (a)  OE,  r  =  sin(0  +  0.3sin(10I),  (b)  IF 

r  =  sin(t)  +  0.3sm(10O.  W  IE.  r  =  s;n(0. 


parameters  vary  slowly  compared  lo  the  other  states  of  the  system. 
The  question  of  stability  is  then  divided  into  two  parts,  the  inner- 
loop  stability,  i.e..  the  stability  of  the  closed-loop  system  with  fixed 
controller  parameters,  or  the  stability  of  the  matrix  A{x).  and  the 
outer-loop  stability,  i.e..  the  stability  of  the  adaptation  iLself,  or  the 
stability  of  the  nonlinear  averaged  system. 

This  paper  concentrates  on  the  question  of  outer-loop  stability.  An 
interesting  example  of  the  use  of  averaging  analysis  to  guarantee 

inner-loop  stability  is  found  in  the  work  of  [5],  in  the  context  of 

adaptive  echo  cancellation.  As  for  outer-loop  stability,  averaging 
allows  us  to  make  precise  the  notion  of  tuned  values,  first  introduced 
in  t9]  and  discussed  in  the  context  of  averaging  in  [41.  Assuming  that 
the  average  exists,  the  tuned  values  are  defined  as  the  equilibrium 
points  of  the  averaged  system,  i.e..  the  solutions  of  =  0. 

A  tuned  value  is  said  to  be  stable  if  the  equilibrium  is  locally 
asymptotically  stable. 


parameter  co  for  the  single-tone  input  and  for  the  multitone  input. 
The  response  of  do  is  similar  and  is  omitted  for  brevity.  Again,  the 
initial  conditions  are  all  zero,  except  co(0)  =1.14,  do(0)  =  -0,65. 
The  gain  5  =  1-  The  input  error  scheme  is  not  destabilized  by  the 
multitone  inpuL  For  single-tone  inputs,  the  parameters  converge  to 
the  same  values  as  tlie  output  error  scheme,  i.e..  the  values  that 
achieve  model  matching.  In  the  presence  of  the  multitone  inpuL  the 
parameters  converge  to  values  close  to  those  obtained  for  the  sin(t) 
inpuL  a  clearly  desirable  feature  since  this  is  tlie  dominant  component. 

The  averaging  analysis  of  Sections  V  and  VI  will  confirm  that 
the  different  responses  of  the  algorithms  reflect  intrinsic  differences 
of  robustness  and  not  a  peculiar  choice  of  example.  Tlie  analysis 
will  also  allow  us  to  answer  questions  tliat  naturally  arise,  such  as. 
Toward  which  values  do  the  parameters  of  the  IE  scheme  converge 
in  the  presence  of  multitone  inputs?  Does  the  convergence  depend 
on  the  initial  conditions?  Why  is  the  OE  scheme  unstable? 

IV.  AVERAGING  Theory  and  Tuned  Values 
Averaging  is  a  method  of  analysis  for  differential  equations  of  the 
form 

i  =Z  x),  x(0)  =  xo.  (Al) 


V.  Averaging  Analysis— Feedeorward  Gain  Alone 
In  this  section,  we  apply  the  averaging  theory  to  the  analysis 
of  the  adaptive  schemes  of  Sections  11  and  Ill,  but  with  only  the 
feedforward  gain  co  being  updated.  This  intermediate  step  will  be 
helpful  to  interpret  the  later  results.  While  the  analysis  is  restricted  to 
the  specific  adaptive  scheme  with  one  parameter,  there  is  considerable 
generality  in  the  results,  as  the  plant  and  reference  model  are  taken 
to  be  arbitrary  transfer  functions.  Tlie  adaptive  system  with  one 
parameter  co  was  considered  by  Riedle  and  Kokotovic  [13]  and 
further  studied  in  [3].  The  equations  for  the  adaptive  systems  are 
the  same  as  before,  except  that  do  =  0.  For  the  subiliiy  and  for  the 
averaging  analysis,  the  following  assumptions  are  made. 

(Al)  The  plant  transfer  function  P{s)  is  stable.  (A2)  Tlic  input  r 
is  given  by  r  =  sin(ui,t)  with  r;  yf  0. 

Assumpuons  (Al)  and  (A2)  are  sufficient  to  guarantee  the  existence 
of  the  averaged  system.  Assumption  (A2)  can  be  slightly  relaxed,  but 
this  form  will  simplify  the  presentation. 

Proposition  5. 1 — Averaged  Systems  for  the  OE  and  IE  Schemes:  The 
averaged  system  for  the  output  error  scheme  is  given  by 

c..  =  _£^r?(RelP(j-w.)]c...  -  RelM(jw,)l)  (5.1) 


The  main  assumptions  arc  that  e>  0  is  small  and  that  the  limit  and  for  the  input  error  scheme 

x)  dr  (4.2)  d..  =  -f  cLEr?(|Fa-.)f  -  Re[M(rw.)F* (>.)]).  (5.2) 

i=l 


/av{x) 


exists  uniformly  in  to  and  t.  The  averaged  system  is  then  defined  by 
i.v  =  e/av(x.v),  Xav(O)  =  XQ.  (4.3) 

Given  some  smoothness  assumptions  on  /  and  /av,  it  can  be  shown 
thaL  for  f  sufficiently  small,  the  trajectories  x(f)  and  x.v(t)  are 
arbitrarily  close  as  f  -*  0  on  intervals  [0,  T/e],  where  T  is  arbitrary. 
Further,  if  x.v  =  x+  is  an  exponentially  stable  equilibrium  point  of 
the  averaged  system,  the  trajectories  of  the  original  system  locally 
converge  to  a  ball  around  x''",  whose  size  shrinks  to  zero  as  e  — •  0. 
The  theory  can  also  be  extended  to  so-called  two-time  scale  systems, 
of  the  form 

i  =  e/(t,  X,  y) 

y  =  A{x)y  -F  h{t,  x).  (■^•5) 

Equations  (4.4)-(4.5)  are  especially  natural  in  adaptive  control,  where 
X  is  the  adaptive  parameter  vector  and  y  is  the  vector  of  plant  and 
controller  states.  Averaging  has  been  applied  in  this  context  by  several 
authors  (among  others,  [1],  [10],  [15]).  The  assumption  of  e  small  is 
equivalent  to  tlie  assumption  of  slow  adaptation,  i.e.,  that  the  adaptive 


The  proof  is  given  in  [3]. 

Proposition  5.2— Tuned  Values  for  the  OE  Scheme:  Assume  that 
'Zr-pl  Re[F(yw.)]  /  0.  Then,  the  averaged  system  for  the  output 
error  scheme  has  a  unique  tuned  value: 

+  _  E?=i''?MA^(M)]  (5  3) 

“  EC^MF(M)1 

which  is  stable  if  and  only  if  52"=i''?  Re[F[jw,)]  >0. 

Comments:  Proposition  5.2  follows  directly  from  Proposition  5.1 
and  is  essentially  the  result  of  Riedle  and  Kokotovic  [13].  The 
authors  also  showed,  through  simulations,  that  the  stability  criterion 
establishes  a  tight  stability-instability  boundary  for  the  onginal 
adaptive  system.  The  condition  is  satisfied  if  F(s)  is  strictly  positive 
real,  which  requires  Re[F(iw)l  >  0  for  all  ui  >  0.  However,  the 
condition  is  weaker  in  the  sense  that  the  transfer  function  must 
have  a  positive  real  part  only  at  the  frequencies  of  excitation  and, 
further,  some  “undesirable”  frequencies  can  be  tolerated  as  long  as 
the  contributions  of  the  frequencies  in  the  positive  region  exceed 
those  in  the  negative  region. 


Proposition  5.3 — Tuned  Values  for  the  IE  Scheme:  Assume  tlial 
\P(JuJi)\  0  for  some  i.  Then,  aside  from  cj,.  =  0,  the  averaged 
system  for  the  input  error  scheme  has  a  unique  tuned  value  given  by 

+  ^  E"=i Rc[M0w,)F'0T/.)]  ^5 

Er=.'-?l^(i=^0P 

which  is  always  stable.  Funher,  minimizes  the  averaged  squared 
error. 

Comments:  Projwsition  5.3  follows  directly  from  Proposition  5.1. 
While  the  output  error  scheme  can  be  destabilized  by  a  high- 
frequency  input,  such  is  not  the  case  for  the  input  error  scheme. 
In  fact,  the  surprising  result  is  that  no  frequency  or  combination  of 
frequencies  can  destabilize  the  scheme.  The  result  is  independent 
of  the  specific  transfer  function  P{s)  and,  therefore,  shows  that  the 
scheme  is  intrinsically  more  robust.  Perhaps  the  most  intriguing  part 
of  the  results  is  the  fact  that  the  tuned  value  for  the  input  error  .scheme 
minimizes  the  averaged  squared  error,  although  the  output  error  does 
not  appear  explicitly  in  the  adaptation  law.  It  is  often  believed  that  the 
fact  that  the  output  error  is  used  to  update  the  parameter  in  the  output 
error  scheme  is  a  guarantee  that  the  error  would  be  minimized.  Tliis  is 
not  true,  however,  as  the  scheme  can,  in  fact,  be  destabilized,  leading 
to  unbounded  errors.  On  the  otlier  hand,  the  input  error  scheme  always 
converges  to  a  neighborhood  of  the  parameter  value  for  which  the 
error  is  minimized, 

VI.  Averaging  Analysis— Feedeorward  and  Feedback  Gains 

For  the  averaging  analysis,  we  make  the  following  assumptions. 

(Al)  Along  the  trajectories  of  tlic  averaged  system,  the  closed- 
loop  transfer  function  P(s)/(1  —  doP{s))  is  stable.  (A2)  Tlie  input 
r  is  given  by  r  =  E"=i ^  ^ 

Rc(M(jw.)]  >  0,  Im(M0u.',  )l  <  0  for  all  i. 

Assumption  (Al)  is  hard  to  guarantee  a  priori  and  is  usually 
checked  a  posteriori,  the  stability  of  a  tuned  value  will  guarantee  the 
local  subility  of  the  adaptive  system  if  the  tuned  value  coaesponds 
to  a  stable  inner  loop.  Assumption  (A2)  restricts  the  class  of  inputs 
so  that  the  averaging  analysis  is  possible.  The  conditions  r;  5^  0 
and  w;  /  0  imply  that  the  input  is  sufficiently  rich  to  guarantee 
cxpioncntial  parameter  convergence  in  the  ideal  case  (no  unmodeled 
dynamics).  Assumption  (A3)  is  not  really  a  new  assumption  since 
M(.s)  is  given  by  (2.3).  However,  the  results  are  valid  for  arbitrary 
M{s)  provided  that  the  function  satisfies  (A3). 

Theorem  6.1 — Averaged  Systems  for  the  OE  atul  IE  Schemes:  The 
averaged  system  for  the  output  error  scheme  is  given  by 


Re[F(i'.u.)] 


Re[P(ju},)M{ju!,)] 

\P{j^.)f  RclMO-cu,)]^ 


^RejP(jai,  )-A/'(jw,  )]  j  1,0  c,,. 

A.(dav)  =  (1  -  Rc[P(j-^\)]f  +  (rGv  lmlP(jtu.)])^  (6.4) 

For  the  input  error  scheme,  (6.1)  also  holds,  but  with 

s  —  \  P(  ■  f  ^  Re[i'^(i‘^t)l  A 


-  VIWO-.)^  RelP0--.)l  j  ^  ’  V  0  eL 


A,(d,,.)  =  (1  -  Re[P{ia.v)])''  +  (d...  ImlPOTi.)])^  (6.7) 

Comments:  The  proof  of  the  theorem  is  in  the  Appendix.  The 
averaged  systems  have  been  written  in  a  common  and  compact  form 
that  exhibits  the  similarities  with  the  one-parameter  case.  However, 
the  extension  from  one  to  two  parameters  has  considerably  increased 
the  complexity.  The  main  reason  is  that  feedback  has  made  the  system 
nonlinear.  The  term  A,(r/.v)  is  a  polynomial  of  degree  2  that  is 
always  positive  and  is  the  square  of  the  closed-loop  characteristic 
polynomial  evaluated  at  *■  =  jiu,.  A  similar  term  is  found  in  the 
n-parameter  case  without  unmodeled  dynamics  (15],  The  matrix 
C(c.v,  d.v)  is  another  nonlinear  teuu,  but  since  it  is  nonsingular 
for  c,v  0,  is  has  no  effect  on  the  location  of  the  tuned  values. 

For  the  rest,  most  of  the  difference  between  the  OE  scheme  and  the 
IE  scheme  is  in  the  matrices  A;.  In  the  IE  scheme,  the  matrices  A. 
ae  symmetric,  and  they  arc  positive  definite  as  long  as  \P(iu};)\  0 

(semi-definile  otherwise).  This  is  reminiscent  of  Section  V,  where  the 
coefficient  of  C;,,.  was  |P(ju,',  )|’^.  On  the  other  hand,  the  coefficient 
for  the  OE  .scheme  in  the  scalar  case  was  Re[P(yT.’,)J,  which  could 
be  negative  in  the  presence  of  unmodeled  dynamics.  In  the  two- 
parameter  case  of  the  OE  scheme,  the  matrices  .4.  arc  generally 
neither  symmetric  nor  positive  definite.  We  will  see  the  implications 
of  these  properties  in  the  next  theorems. 

Theorem  6.2 — Tuned  Values  for  the  OE  Scheme:  For  u  =  1;  if 
|P(7uii  )|  f  0,  the  averaged  system  for  the  output  error  scheme  has 
a  unique  tuned  value; 

/c+.  \  _  /lmlP(jui,)llA/(ju^,)|Vt^W(j‘-’i))l^(7^>)n 

U-v  /  “  I  Im[P-(7a.-,)M(jui,)]/lm[3f{jra,)l|P(jW|)P  J 

^  (6.8) 

which  is  stable  if  Im(P(jar, ))  <  0.  The  tuned  value  is  such  that  the 
closed-loop  transfer  function  matches  the  reference  model  transfer 
function  at  wi. 

For  arbitrary  n:  the  averaged  system  for  the  output  error  scheme 
has  at  most  4:1  -  3  tuned  values  (aside  from  a  possible  tuned  value 
with  ci,  =  0). 

Comments:  The  proof  of  the  theorem  is  left  to  the  Appendix.  The 
results  are,  unfortunately,  much  more  limited  than  in  the  case  of  a 
feedforward  gain  alone.  Of  course,  we  should  not  expect  too  much, 
given  the  weak  assumptions  made  on  the  plant  transfer  function. 
However,  we  will  also  show  by  example  that  the  tuned  values  cannot 
be  bounded,  that  there  may  exist  one,  several,  or  none  at  all,  and  that 
they  may  all  be  unstable. 

In  the  case  of  a  single  sinusoid,  the  tuned  value  exists,  it  is  unique, 
and  it  is  the  value  that  achieves  model  matching.  This  is  consistent 
with  simulations  and  the  observations  of  Astrom  [2],  The  tuned  value 
becomes  unstable  when  Im[P(juii)]  >  0.  However,  this  happens  at 
the  same  time  as  when  the  tuned  value  corresponds  to  an  unstable 
inner  loop.  In  other  words,  inner-  and  outer-loop  instabilities  occur 
at  the  same  time.  This  is  different  from  the  Riedle  and  Kokotovic 
example,  where  inner-loop  stability  was  always  guaranteed. 

Theorem  6.3— Tuned  Values  for  the  IE  Scheme:  For  n  =  I.'  if 
lP(ju<i)|  #  0,  the  averaged  system  for  the  input  error  scheme  has  a 
unique  tuned  value  (aside  from  r.jf,.  =  0),  which  is  the  same  as  for 
the  output  error  scheme,  and  which  is  always  .stable. 

For  arbitrary  n:  there  exists  at  least  one  tuned  value  and  at  most 
4ii  —  3.  The  tuned  values  arc  bounded  by 

AE  \  <  ,  EE, I'Ll  _  (6.9) 

\rEv  /  (mill,  A,„;„{.4, )) 

Comments:  The  proof  of  the  theorem  is  left  to  the  Appendix.  The 
results  for  the  input  error  scheme  are  more  significant  than  for  the 
output  error  scheme.  We  arc  guaranteed  that  there  exists  at  least  one 
tuned  value,  and  that  all  the  tuned  values  are  bounded.  Still,  the 
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resulLs  are  less  significant  than  in  the  case  of  the  feedforward  gain 
alone,  except  when  there  is  a  single  sinusoid.  In  that  case,  the  results 
arc  similar  to  the  case  of  the  feedback  gain  alone,  guaranteeing  not 
only  the  uniqueness,  but  also  the  stability  of  the  tuned  value.  Again,  it 
should  be  remembered  that  this  guarantees  the  stability  of  the  adaptive 
system  only  if  the  tuned  value  corresponds  to  a  stable  inner  loop. 

The  main  generic  difference  between  the  OE  and  IE  schemes  is  that 
the  tuned  values  cannot  be  bounded  in  the  case  of  the  OE  scheme. 
This  tack  of  boundedness  can  be  traced  to  the  fact  that,  although  the 
matrices  Ai  in  (6.1)  are  summed  with  positive  weights,  the  resulting 
matrix  may  be  singular  in  the  OE  scheme.  On  the  other  hand,  in  the 
case  of  the  IE  scheme,  the  matrices  are  symmetric  positive  definite 
so  that  their  weighted  sum  will  always  be  positive  definite.  This  fact 
can  be  extended  to  adaptive  systems  with  the  number  of  parameters 
greater  than  two.  and  is  not  limited  to  the  two-parameter  case. 


VII.  Analysis  of  the  Multitone  Instabilities 

We  now  return  to  the  example  of  Section  II.  with  r  =  r,  sin(t)  + 
rj  sin(iot),  and  use  the  results  of  Section  VI.  We  observed  that  die 
adaptive  system  was  unstable  for  ri  =  1,  rj  =  0.3.  However,  the 
system  was  stable  for  r,  =  1,  =  0  and  r,  =  0.  =  1-  To 

gain  insight  into  this  phenomenon,  the  tuned  values  were  calculated 
for  various  values  of  ri,  r^.  Within  tJie  assumptions  of  averaging 
(sufficiently  small  adaptation  gain),  only  the  ratio  of  ri  over  r, 
is  important.  Tlicrefore,  we  let  ri  =  1  -  r^.  As  is  shown  m  die 
Appendix,  the  tuned  values  can  be  calculated  by  eliminating  ar^d 
solving  a  fifth-order  equation  for  dj,.  Fig.  3  shows  die  locus  of  <i.„ 
as  r2  is  varied  from  0  to  1  and  n  =  1  -  r2.  S  denotes  a  stable 
tuned  value  and  U  an  unstable  one.  The  cases  when  r2  =  0  and 
r2  =  1  correspond  to  single  tones,  for  which  we  confirm  that  a  unique 
and  stable  tuned  value  exists.  However,  in  the  intermediate  region, 
the  transition  is  far  from  being  smooth.  In  particular,  the  simulation 
of  Section  II  corresponds  to  a  case  where  a  unique  unstable  tuned 
value  exists,  which  explains  the  instability  of  the  original  adaptive 
system.  For  other  ratios,  there  may  exist  up  to  three  tuned  values 
(the  maximum  of  five  is  not  reached),  or  none  at  all.  Even  though 
the  absence  of  tuned  values  occurs  only  for  specific  values  of  rt 
(c.g.,  r2  =  0.16).  the  tuned  values  become  unbounded  as  the  ratio 
is  approached,  proof  of  the  fact  that  no  bound  can  be  found  for  the 
tuned  values  of  the  output  error  scheme. 

The  locus  for  the  input  error  scheme  is  shown  in  Fig.  4.  Again, 
the  cases  when  r2  =  0  or  rj  =  1  correspond  to  single  tones,  for 
which  unique  and  stable  tuned  values  are  known  to  exist,  identical  to 
those  of  the  output  error  scheme.  The  transition  between  the  two  is 
far  smoother  than  previously,  however,  and  the  tuned  values  always 
remain  bounded.  Further,  there  always  exists  a  tuned  value,  and  there 
also  always  exists  a  stable  tuned  value  (this  is  not  guaranteed  by 
Theorem  6.3). 

Fig.  4  also  gives  special  justification  to  the  name  of  tuned  value.  For 
a  large  range  of  amplitudes  of  the  additional  sinusoid  at  u;  =  10  rad/s, 
the  tuned  value  remains  close  to  its  original  value  for  a  single  tone 
at  w  =  1  rad/s.  The  adaptive  system  tunes  itself  to  the  low-frequency 
component.  When  the  component  at  w  =  10  rad/s  is  dominant,  the 
adaptive  system  tunes  itself  to  the  high  frequency  by  adjusting  its 
parameter  close  to  the  value  of  the  parameter  obtained  for  the  high 
frequency  alone. 

There  is  also  a  range  of  amplitudes  where  the  adaptive  system 
may  tune  itself  either  to  the  low-frequency  or  to  the  high-frequency 
component.  Figs.  5  and  6  show  the  steady-state  output  error  responses 
for  do(0)  =  -3  and  do(0)  =  -4,  respectively.  The  parameters 
r,  =0.24,  t2  =0.76,  and  g  =  10-  In  the  first  case,  the  adaptive 
parameter  do  converges  to  a  value  close  to  the  value  for  a  single 


Pig.  3.  Locus  of  tuned  values,  oulput  error  scheme. 


Fig.  4.  Locus  of  tuned  values,  input  error  scheme. 


tone  at  u)  =  1  rad/s,  and  the  residual  error  consists  mostly  of 
the  component  at  cu  =  10  rad/s.  In  the  second  case,  the  adaptive 
parameter  converges  to  a  value  close  to  the  value  for  a  single  tone  at 
ai  =  10  rad/s,  and  the  residual  error  consists  mostly  of  the  component 
at  w  =  1  rad/s.  The  adaptive  system  either  tunes  itself  to  the  low- 
frequency  or  to  the  high-frequency  componenL  depending  on  initial 
conditions.  In  both  cases,  the  residual  output  error  has  a  comparable 
magnitude.  Although  the  existence  of  a  unique  tuned  value  mig  t 
have  been  found  desirable  a  priori,  we  see  that  the  higher  number  is 
not  necessarily  contradictory  to  die  correct  behavior  of  the  adaptive 
scheme. 

Before  completing  this  discussion,  we  may  reflect  on  the  reasons 
why  the  two  model  reference  adaptive  control  schemes  lead  to  totally 
different  responses  to  multitone  inputs.  Aside  from  the  results  of 
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adaptive  system  with  unmodeled  dynamics  which  was  stable  for 
sinusoidal  reference  inputs,  but  became  unstable  for  the  sum  of  two 
such  signals.  Other  situations  were  found  where  several  regions  of 
attraction  of  the  adaptive  parameters  existed.  Their  number  depended 
on  the  location  of  the  spectral  components  and  on  their  respective 
amplitudes. 

The  technique  of  averaging  was  used  to  analyze  these  effects,  and 
demonstrated  its  power  for  the  study  of  the  dynamic  behavior  of 
adaptive  systems.  The  concept  of  tuned  value  as  an  equilibrium  point 
of  the  averaged  system  was  found  to  be  particularly  useful.  It  is  hard 
to  imagine  how  one  could  have  made  ^^nse  of  the  adaptive  system 
responses  to  various  multitone  inputs  without  the  peculiar  locus  of 
tuned  values  shown  in  Fig.  3.  Intuitive  explanations  of  the  multitonc 
instabilities  were  also  provided.  We  argued  that  the  minimization  of 
the  tracking  error  in  the  presence  of  unmodeled  dynamics  led  to 
conflicting  requirements  which,  in  turn,  resulted  either  in  tuning  or 
instabilities.  Tlie  effect  of  the  choice  of  error  equation  was  found  to 
be  particularly  important.  We  showed  tliat  the  input  error  scheme  was 
mere  robust,  and  this  claim  was  not  only  supported  by  simulations  on 
an  example,  but  also  by  an  analysis  that  did  not  depend  on  specific 
unmodelcd  dynamics. 

Tlic  observations  of  this  paper  are  not  contradictory  to  other 
research  results  in  robust  adaptive  control  emphasizing  update  law 
modifications.  Tlicsc  modifications  should  be  used  to  increase  robust¬ 
ness  and  prevent  parameter  drift  in  the  presence  of  noise  and  inputs 
that  are  not  sufficiently  rich.  Also,  tiic  effect  of  input  disturbances 
and  measurement  noise  was  not  addressed  in  this  work,  and  a  careful 
study  might  lead  to  interesting  conclusions. 


the  averaging  analysis,  wc  would  like  to  propose  the  following 
explanation.  In  Section  V,  we  reviewed  the  one-parameter  case. 
The  conclusion  was  that  the  scheme  was  stable  for  single  sinusoids 
provided  that  Re{POati))  >  0.  When  several  sinusoids  were  present, 
their  effect  was  cumulative,  i.e.,  the  subility  was  determined  by 
the  weighted  sum  of  Re[P(jw,)].  In  the  case  of  the  two-parameter 
example,  the  stability  in  the  presence  of  a  single  sinusoidal  input 
is  also  guaranteed  for  sufficiently  low  frequencies.  However,  the 
condition  is  that  Im[P(jwi)]  <  0.  Note  that,  in  the  example. 
Rc(P0'  LJ  )]  >  0  for  w  <  2.7  rad/s,  but  Im(P(jw)]  <  0  for  w  <  16.1 
rad/s.  Why  is  the  range  of  allowable  frequencies  larger  in  the  two- 
parameter  case  than  in  the  one-parameter  case?  It  turns  out  to  be 
because  the  feedback  gain  do  makes  the  real  part  of  the  closed- 
loop  transfer  function  positive  for  a  larger  range  of  frequencies. 
Indeed,  it  can  be  checked  that  Im[P(jivi)]  <  0  is  also  the  condition 
such  that  the  tuned  value  given  by  Theorem  6.2  makes  the  transfer 
function  from  variations  6co,  6do  to  variations  6eo  strictly  positive 
real. 

In  the  presence  of  multitone  inputs,  model  matching  can  only 
be  approximately  achieved.  If  the  low  frequency  component  is 
dominant,  the  adaptive  system  may  tune  itself  to  the  low  frequency. 
A  high-frequency  residual  error  will  then  be  present.  However, 
the  real  part  of  the  closed-loop  transfer  function  is  not  positive 
at  that  higher  frequency.  Simulations  and  calculations  have  told 
us  that  this  perturbation  can  easily  destabilize  the  output  error 
scheme.  On  the  other  hand,  the  input  error  scheme  does  not  require 
strictly  positive  real  conditions  for  stability  in  the  case  of  single- 
tone  inputs.  We  found  that  it  is  not  destabilized  by  the  multitone 
input. 


VIII.  Conclusions 

In  this  paper,  wc  observed  some  highly  nonlinear  phenomena  in 
adaptive  systems.  In  particular,  an  example  was  presented  of  an 


Appendix 

Proof  of  Theorem  6.1:  We  prove  the  result  for  the  output  error 
scheme  only.  The  proof  for  the  input  eiror  scheme  follows  along 
similar  lines.  Assuming  that  the  parameters  co  and  do  are  fixed,  the 
output  is  given  by  jtp  =  (coP/l  —  doP)(r],  and  therefore. 
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^[r]  -f  codo 


l-doP 

PM 


I  -  M(r 
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By  assumption  (A2),  the  average  of  (Al.l)  is  therefore  given  by 


E 


1 _ 

|l-doP0w.)P 


(co|P(jw.)|"  -  Re[P(jw.)Af'(jw.)] 
-t-rfolP(jw,)pRe[M(9w,)]). 


(A  1.2) 


The  average  of  (vp  —  y.„)r  is  similarly  obtained  and,  by  making  the 
appropriate  definitions,  the  results  of  the  theorem  follow. 

Proof  of  Theorem  6.2:  For  n  =  I;  assume  first  that  C(cav,  dav)  is 
nonsingular.  Under  the  conditions  of  the  theorem,  .4.1  is  nonsingular, 
so  that  the  tuned  value  is  unique  and  given  by  Aj  '6i.  It  can  also 
be  checked  tliat  the  tuned  value  solves  CavP[iwi]/l  —  davPliruii  = 
M{jwi),  i.e.,  that  the  tuned  value  corresponds  to  the  matching  of  the 
reference  model  by  the  closed-loop  transfer  function.  The  stability  of 
the  averaged  system  around  the  tuned  value  is  detennined  by  the 
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linearized  system  and.  therefore,  by  llte  stability  of  the  matrix 
2 

_  _  r,  Im[P(ii^i)] 

^  2  [m[AV(iuii)] 

/Rc(Af(i(xi01  (Re[A/_0w,)])"  -  (Im(M(ic^t)]f  \ 

‘iiM(juit)P  |M(M)rRe(ilV(iui)]  j 

(A2.1) 


Again,  it  is  easy  to  check  that  this  system  is  stable  under  the 
conditions  of  the  theorem.  Now,  let  us  return  to  the  assumption  that 
C(c»v,  d.v)  is  nonsingular.  C  is  singular  if  and  only  if  c,v  =  0. 
Assuming  that  a  tuned  value  with  cj,  =  0  exists,  it  would  have  to 
satisfy 

<i»v  |P(ju7i)r  Re(M0w,)]  -  Re(PO'.ui)Af(iw,)) 

—  dt^,  Rc[P(juii)M' 

+  Re(Af(jwi)l  =  0  (A2-2) 


and  tlicrefore  d+"|P0u;,)l"  -  2d+  Rc[P(ju;.)l  +  1=0.  However, 
tliis  equation  has  no  real  solution  if  Im[P(jtJi)l  ^  0. 

For  arbitrary  n,  the  equation  for  the  tuned  value  is  quite  more 
complicated.  If  c+  yf  0,  C(c.v,  is  nonsingular.  Take  i.  =  2  for 


ri  A2(d^v)  I 


<+*■  s 

u-av 

dl 

)-l, 

+  r2  A  I  (^av  )  ( 


0. 
Va2  Tt 


Tliis  is  a  set  of  two  polynomial  equations  in  c^,.,  d^s,-  The  maximum 
power  of  d+  is  3  and  of  c+  is  I.  The  coefficient  of  c+  is  a 
polynomial  in  d+  of  degree  2.  Therefore,  c+  can  be  eliminated 
to  yield  an  equation  of  degree  5  for  <f^„,  which  has  at  most  five  real 
solutions,  ci,  is  then  determined  uniquely  from  d^^.  The  procedure 
is  easily  extended  for  arbitrary  n,  indicating  that  there  are  at  most 
4ii  —  3  tuned  values. 

Proof  of  Theorem  6.3:  The  proof  is  similar  to  the  proof  of  Theorem 
6.2.  The  matrix  of  the  linearized  system  is  now 

2  J.2 

-pyC’(c.v,  =  -5ycL|P(ywi)|^ 

/  1  Re(M0u;i)l\ 

yRe(MOwi)]  lAf|^  / 

(A3.I) 


which  is  always  stable.  The  fact  that  the  matrices  A.  are  symmetric 
and  positive  definite  implies  the  additional  results.  In  particular,  the 
equation  for  dj,  is  again  a  polynomial  of  order  4n  —  3,  but  its  leading 
coefficient  is  guaranteed  to  be  nonzero.  Since  the  order  is  odd,  there 
must  exist  at  least  one  real  tuned  value.  The  bound  on  the  tuned 
values  also  follows  from  the  positive  definiteness  of  the  matrices  A.  . 
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The  Effect  of  Time  Delay  and  Discrete  Control  on  the 
Contact  Stability  of  Simple  Position  Controllers 

John  Fiala  and  Ronald  Lumia 


Abstract — By  analysis  of  the  driving-point  admittance,  it  Is  shown 
how  time  delays  and  discrete  control  can  create  instabilities  for  a  simple 
position  controller  in  contact  with  the  environmenL  The  lowest  frequency 
of  contact  instability  due  to  rime  delay  or  sampling  is  determined 
analytically.  It  is  shown  how  mechanical  compliance  between  the  motor 
and  point  of  conUct  can  eliminate  these  instabilities.  To  achieve  the  best 
relative  stability  when  contacting  arbitrary  environments,  the  mecham- 
cal/control  design  of  manipulators  should  maintain  a  critical  relationship 
between  the  frequency  of  the  compliant  mode  and  a  frequency  associated 
with  contact  instability. 


1.  Introduction 

The  simple  proportional-dcrivalive  (PD)  controllers  used  for  con¬ 
trolling  most  robots  show  a  remarkable  robustness  in  a  number  of 
tasks,  including  those  which  involve  contact  with  the  environment. 
Recentiy,  some  authors  have  noted  that  the  time  delays  and  sampling 
in  these  controllers  should  have  a  detrimental  effect  on  stability  during 
contact  with  certain  environments.  Goldenberg  and  Clark  [1 1  describe 
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Identification  with  Modelii^  Uncertainty 
and  Recon figu rafale  Control 

Marc  Bodson 


Department  of  Electrical  and 
Carnegie  Mellon  University, 

Abstract 

The  problem  of  obtaining  reliable  estimates  of  uncert^nty  in  >h 
parameters  identified  through  a  least-squares  algorithm  .s  - 
cussed.  Estimates  based  on  a  stochastic  analysis,  an  ana  yst 
assuming  bounded  noise,  and  a  sensitivity  an  ysis  are 
reviewed.  The  results  are  compared  and  illustrated  usmg  expe  - 
internal  data  obtained  on  a  DC  motor.  1110  need  f- 
estimation  of  uncertainty  is  justified  m  the  context  of  adapti  e 
control,  where  lobusmess  and  transient  perform^ce  are  cnucM^ 

In  particular,  the  application  to  reconfigurable  flight  ^ 

cor^idered.  Design  tradeoffs  for  flus  applicauon  - 
in  detail  and  iUustrated  tluough  simulations  usmg 
models. 

I.  Introduction 

There  has  been  a  considerable  interest  in  recent  years  m  new 
methods  of  identification  that  account  for  modelmg  uncert^nty 
n.  weU  as  measurement  noise.  Reasons  for  this  mterest  include^ 

-  traditional  methods  of  identification  do  not  provide  the  kind  of 
information  that  is  used  in  robust  control  system  design,  namely 
measures  of  nonparametric  dynamic  uncertainty;  . 

.  by  not  accounting  for  such  uncertainty,  tradiuonal  methods 
oflCT  provide  overly  optimistic  estimates  of  the  parametric 
uncertainty;  . 

•  stochastic  measures  of  parametne  uncertamty  are  probabihs- 
lic.  whereas  robustness  measures  assume  hard  bounds  on  trans¬ 
fer  function  models. 

Recent  papers  that  address  various  parts  of  this  problem  indude 
111-191-  The  adaptive  control  community  is  particularly  inter¬ 
ested  in  developing  robust  adaptive  controllers  using  die  sunple 
paradigm  that  robust  adaptive  control  equals  robust  idenufi<»- 
Ln  plus  robust  control.  Indeed,  there  is  evidence  that  the 
robustness  and  transient  performance  properties  of  existing 
algorithms  need  to  be  better  understood.  In  a  recent  paper  [10], 
we  found  drat  the  transient  response  of  certain  adapUve  algo¬ 
rithms  exhibited  repeated  large  peaks.  A  similar  phenomenon 
called  burst  phenomenon  is  known  to  occur  m  adapUve  systems 
with  measurement  noise  but,  in  this  case,  the  respo^es  were 
observed  in  systems  without  any  noise  or  unmodeled  dynamics^ 

In  fill  we  discussed  the  case  of  an  adapUve  system  which 
became’  unsuble  when  the  reference  input  was  composed  of  the 
sum  of  two  sinusoids,  even  though  it  was  stable  when  excited 
by  a  single  sinusoidal  component.  Such  examples  demonstrate 

Research  supported  by  the  Air  Force  Office  of  Scientific 
Research  under  grant  F49620-92-J-0386.  The  U.S.  gov¬ 
ernment  has  certain  rights  in  this  material. 
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the  complexity  of  the  issues  of  transient  performance  and 
robusmess  in  adaptive  systems,  and  justify  an  mterest  for  alter¬ 
nate  approaches. 

Traditional  adaptive  control  systems  are  based  on  the  cenainty 
equivalence  principle,  so  that  a  recursive  identifier  is  combmed 
with  a  control  algorithm  which  uses  the  estimated  parameters  as 
if  they  were  perfect  estimates.  New  adaptive  algorithms  would 
not  only  use  the  estimates  of  tlie  parameters,  but  also  ilie  esti¬ 
mated  uncertainty  affecting  tliese  parameters  (sec  Fig.  1). 


UNCERTAINTY 

Figure  1;  Traditional  and  New  Adaptive  Control  Structures 

By  being  more  cautious  when  necessary,  one  may  expect  that 
such  new  methods  will  exhibit  superior  transient  performance 
and  robusmess  properties. 

A  special  area  of  application  of  robust  adaptive  control  is  that  of 
rcco^gurable  control  which  we  define  as  the  control  of  multi- 
variable  systems  in  the  presence  of  drasuc  changes  m  ^amics 
due.  for  example,  to  acmator  failures.  In  such  cases,  the  s^c- 
tuie  of  the  controller  may  have  to  be  altered  (as  opposed  to  fine- 
tuning  of  the  parameters).  The  U.S.  Air  Force  has 
ested  in  reconfigurable  control  because  of  the  potenud  benefit 
for  increased  survivability  and  reduced  need  for  hardwire 
redundancy  [12],  [13].  In  flight  control  it  ^  difficult  to 
fault  tolerance  using  fault  detection  and  idenufication  b«aus 
of  the  large  number  of  possible  failures  and  the  large  "'^^ber 
flight  conditions.  Methods  based  on  automauc  system  iden  . 
cation  and  control  reconfiguration  are 

there  is  a  critical  need  for  an  adaptive  system  that  is  both  robus 
to  uncertainties  and  weU-behaved  in  the  transient  regime. 

2.  Identification  with  Modeling  Uncertainty 

Least-Squares  Algorithm;  Consider  the  Imear  regression 

r  •  N 
v(r)  =  w^(r)5  +^^(0 

where  y(t)  is  a  measured  signal,  w(t)  is  a  vector  of  measured 
signals.  d(t)  is  an  unknown  signal,  and  is  a  vector  of 
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unknown  parameters.  The  least-squares  algorithm  provides  the 
estimate  that  nunimtics  the  error 

/■=! 

and  is  given  by 

where 

n  ^ 

p  =T  R^='LwC‘)y(‘) 

"  <=1  '=t 

Assuming  that  d(/)  =  0.  we  have  that  0a; -6  as  long  as  the 
inverse  of  the  matrix  exists.  Otherwise,  we  first  define 


(3) 


(4) 


(5) 


(6) 


t-l 

?n<i  deduce  tliat 

e^  =  0'  +  R~J  R~j 

Stochastic  Analysis  of  Error;  Assuming  that  tlie  signal  d(t)  is 
a  stochastic  process,  the  estimate  0^  is  itself  a  random  variable 
whose  properties  can  be  predicted.  Specifically,  one  can  show 
[14]  that  if  w(r)  is  dctcnninistic  and  d(r)  is  a  zero  mean  while 
noise  process  with  variance  one  has  tliat 

E(6'u)='9‘ 

The  variance  of  the  noise.  can  itself  be  estimated  using 

where  N  is  the  number  of  data  points  and  n  is  the  dimension  of 
the  parameter  vector.  RA^tj)  «  the  residual  error  obtained  for 
the  least-squares  estimate.  It  follows  an  estimate  of  the  error  in 
the  Uh  component  of  the  parameter  0  is 


{ecs-S)i 


:(«rs) 


(9) 


where  (RZ'h  is  the  iih  diagonal  clement  of  R^  .  The  factor  of 
wo  assumes  that  2  standard  deviations  is  taken  to  be  a  rcason- 
ible  bound  for  the  error,  although  other  choices  arc  possible. 

Die  estimate  of  the  parametric  uncertainty  is  easy  to  calculate, 
fhe  matrices  R^.  R^.  andR,=  Z  y(f  f  can  be  computed  recur- 

f  =1 

sively  if  needed  and 

RAeLs)  =  R,-^LR~^i^ 

=  Ry-Rl,R''J 

The  only  question  concerns  the  assumption  tliat  w(r)  is  deter¬ 
ministic  and  d(t)  is  a  zero  mean  noise,  which  implies  that 
E(w(r)d(r))  =  0.  or  B^R^)  =  0-  This  means  that  the  noise 
sequence  is  uncorrelated  with  the  regressor  vector  w{r).  In 
practice,  the  signal  d(r)  will  be  composed  of  both  the  measure 
mem  noise,  and  a  term  due  to  unmodeled  effects  which  may  be 
highly  correlated  with  w(r).  Consider  for  example  the  process 


y(r)  =  vv(f)-r  £  'V^(I)  -I-  n(r) 


(12) 


where  n(r)  is  a  measurement  noise,  and  cw  (r)  is  a  small 
quadratic  nonlinearity  that  perturbs  the  linear  relationship.  If 
we  approximate  this  model  by 


y(r)=w(r)fl  +d(r) 


(13) 


no  matter  what  choice  of  0'  we  take,  the  signal  d(r)  will  be 
composed  of  both  the  noise  and  a  component  correlated  with 
w(f)  (e.g..  for  =  1,  d{t)  =  e  w\t)  +  n(t)). 

The  estimate  of  the  parameter  error  will  typically  go  to  zena  as 
time  goes  to  infinity.  Indeed,  assuming  that  a  large  number  of 
samples  is  taken,  the  matrix  (l/A.’)  R^  ihe  scalar  (IJN)  R^ 
will  both  tend  to  finite  limits,  so  that  the  estimated  error  will 
tend  to  zero  as  Af  — > ",  following  1  /  ViV.  This  is  a  correct 
result,  assuming  that  the  noise  is  uncorrelated  with  the  regressor 
vector  w(r),  so  that,  over  time,  the  effect  of  noise  averages  out 
to  zero  in  (6).  In  practice,  such  an  assumption  may  lead  to 
overly  optimistic  estimates  of  the  uncertainty  in  the  parameter 
estimates. 

Bounded  Noise  Analysis;  An  alternative  approach  consists  in 
assuming  tliat  the  noise  d(f)  is  bounded 


ld(f)l<B 


(14) 


but  is  otherwise  arbitrary.  Assuming  that  the  bound  is  known, 
each  mcasurcmeni  leads  to  a  constraint 

l^(r)-w^(f)Ol<B  (^5) 

which  can  be  viewed,  for  each  f,  as  a  subspace  of  the  parameter 
space  delimited  by  two  hypcrplancs.  The  estimate  0  now 
becomes  a  parameter  set  0.  which  consists  of  the  intersection 
of  the  subspaccs  obtained  for  all  r.  Keeping  track  of  the  precise 
parameter  set  is  too  complex  and  is  usually  replaced  by  an  over- 
bounding  eUipsoid  ([9]).  The  computations  are  more  extensive 
than  previously  and  the  bound  B  on  the  noise  must  be  kriown. 
Such  bound  may  not  be  easy  to  determine,  since  the  noise  is 
supposed  to  include  both  the  measurement  noise  and  the  contri¬ 
bution  of  the  modeling  uncertainty.  Outliers  may  also  be  more 
of  a  problem  for  such  algorithm,  since  they  could  lead  to  an 
empty  intersection  of  the  parameter  scL 

Am  alternative  but  similar  approach  consists  in  assuming  an  li 
bound  instead  of  an  /_  bound  on  d(t).  Note  that 


RABu)  =  Z  (”'’’(0  (»*  -  Bu)  +  dif) ) 

f»l 

=  {0LS -B'f  R^{eLs-B')-iiBi^~ o'f  R^  +  Rj  (16) 
where  Rj=J^  ‘^(0^-  Using  (6).  it  follows  that 


{0a:-6'f 

Now.  assume  that  a  bound  Rj_^y  is  known  such  that 

1  =  1 

Such  a  bound  can  be  viewed  as  an  bound,  or  as  an  L  bound  if 
we  take  ^  =  N.B'^,  with  1  d(r)  1<  fi.  Then. 

{0u -e’f  R^{0ls-B')^  -  RABis) 

and  a  simple  calculation  (c/.  [15])  shows  that  (19)  is  satisfied  if 

(20) 


{0ls-b');  ^  ),-.  (.Rd.r^x  -  RABls)) 
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which  is  a  new  estimate  of  the  error  in  the  ilh  parameter.  Note 
that  the  inequality  is  a  "hard"  bound,  as  opposed  to  the  stochas¬ 
tic  bound,  although  the  distinction  is  of  limited  practical  interest 
(c/.  [5]).  An  advantage  of  this  estimate  is  that  it  is  easy  to  com¬ 
pute.  On  the  other  hand,  it  requires  the  knowledge  of  „  ^  (ot 

B). 

Sensitivity  Analysis:  Another  view  of  the  problem  consists  in 
looking  at  the  sensitivity  of  residual  error  to  the  parameter  6. 
This  approach  was  used  to  analyze  data  from  nonlinear  models 
of  AC  motors  in  [15],  [16].  Specifically,  the  residual  error  for 
an  arbitrary  parameter  6  is  given  by 

R,(S)  =  0^  R^S-26^  R^  +  Rj  (21) 

The  minimum  value  of  R^  is  given  by  (lO)-(ll),  so  that  (21)  can 
be  revn'itten 

RA^)  =  B.(SLs)  +  (G-6isy  R^(.^-Ols)  (22) 

We  now  try  to  determine  how  sensitive  RA^)  variations 

6  -  e^.  By  definition.  /dO=0  around  llie  minimum,  so  that 
we  must  look  for  another  definition  of  sensitivity.  For  example, 
we  may  look  at  variations  that  lead  to  a  doubling  of  tlic  residual 
error  (i.e..  that  would  cause  an  increase  in  the  residual  error 
equal  to  what  is  obtained  at  the  minimum).  We  have  that 


RA^')  ^  2  RA^ls) 

RAO- eLs}^BA0u) 

(23) 

A  sufficient  condition  for  (23)  is 

(8-ecs)i  ^  '^<iR-JhBAOLs) 

(24) 

Note  that  this  is  not  an  estimate  of  the  error  -  0  but  a  figure 
of  the  sensitivity  of  the  error  with  respect  to  the  estimate.  How¬ 
ever,  the  result  is  remarkably  similar  to  the  previous  estimates 
of  cnors.  The  main  difference  with  the  stochastic  estimate  is 
that  the  factor  (A/  -  n)  is  not  present.  Its  absence  represents  the 
fact  that  no  assumption  is  made  about  the  averaging,  or  "benev¬ 
olence",  of  the  noise.  Compared  to  the  bounded  noise  analysis, 
the  sensitivity  analysis  docs  not  require  the  a  priori  knowledge 
of  the  noise  bound  (although  it  may  be  argued  that  it  implicitly 
assumes  that  Rj^-2  R^  (^rs))-  factor  of  two  in  the  anal¬ 
ysis  is  arbitrary  and  other  choices  such  as  1. 1  (10%  increase 
instead  of  100%),  or  others,  would  be  valid.  The  results  would 
just  be  scaled  by  a  constant  factor. 

Example:  A  DC  Motor.  Consider  the  example  of  the  electrical 
equation  describing  a  DC  motor 

L  —  =  v—Ri  —  Kj(i}  (25) 

dt 

L  is  the  inductance  (//),  R  is  the  resistance  (fl).  and  the 
torque  constant  (A/.  m/A).  The  signals  are:  v  the  voltage  (K).  i 
the  current  (A),  and  a)  the  speed  (radls).  Two  experiments  were 
performed  where  the  voltage  v  was  varied  in  6  equal  steps,  from 
0  to  15  F,  upwards  in  the  first  case  (acceleration)  and  down¬ 
wards  in  the  second  (deceleration).  The  current  and  the  angular 
position  were  measured  and  filtered  using  Butterworth  filters  of 
order  3  to  reduce  the  pulse  width  modulation  noise  and  the 
quantization  noise.  The  derivatives  of  the  current  and  of  the 
position  were  calculated  using  backwards  differentiation.  A 
least-squares  algorithm  was  used  to  identify  L,  R,  and  Kj-, 
assuming  (25). 


The  speed  response  ci(r)  is  shown  on  Fig.  2  and  llic  current  i(r) 
on  Fig.  3  for  the  acceleration  experiment.  The  responses  for  the 
deceleration  experiment  are  omitted  for  brevity. 


Figure  2:  Speed  (rad/s)  as  a  function  of  time  (s) 


Figure  3:  Current  (A)  as  a  function  of  time  (s) 

Fig.  4  shows  the  estimates  of  Kj-,  calculated  every  100  points  or 
50  ms. 


Figure  4:  Torque  constant  (N.m/A)  estimate 
Stochastic  estimates  of  uncertainty 

The  2a  bounds  (9)  provided  by  the  stochastic  analysis  are  also 
shovTi.  The  set  of  responses  on  the  top  were  obtained  witli  tiic 
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acceleration  experiment,  and  the  bottom  ones  with  the  decelera¬ 
tion.  As  one  may  note,  the  stochastic  estimates  arc  quite  opH- 
!iuslic  Given  that  the  signals  arc  fairly  clean  (see  Figs.  2  and 
3)  one  may  suspect  that  the  residual  error  is  largely  composed 
of  modeling  error  and  not  much  of  measurement  noise. 

Cm  5  shows  similar  curves  for  the  sensitivity-based  estimates 
(24).  which  are  more  conservauve,  but  more  accurately  reflect 
the  uncertainty  in  this  set  of  experiments. 


3.  Reconfigurable  Flight  Control  Systems 

leconfigurablc  flight  control  is  a  fascinating  application  of 
rdapdve  control  and  of  methods  of  identification  accounung  for 
nodding  uncertainty.  In  flight  control  linearized  aiiCTaft  mod- 
:1s  arc  typically  assumed,  so  that  one  relies  on  a  standard  state- 

rpacc  model 


X=Ax+Bu 


(26) 


vherc  the  state  variables  are  separated  into  longitudinal  van- 
blcs;  c  (angle  of  attack),  v  rate),  h  (altitude),  v  (vcloc- 

cy)  and  the  lateral  variables:  P  (sideslip),  p  (roU  rate),  r  (yaw 
ate)  ^  (toU  angle),  yr  (yaw  angle).  11.=  control  inputs  are  also 
eparated  into  the  longitudinal  control  vanable.  >’ 

md  the  lateral  control  variables:  (ailerons)  and  (rudder). 

\fter  a  failure,  a  similar  linearized  model  is  valid 


x  —  A x+ B  u  +  d 


(27) 


ut  a  disturbance  d  is  added  to  account  for  the  fact  that  the  ortg- 
,al  equilibrium  is  usually  not  an  equiUbrium  of  the  failed  air- 
raft  A  new  setpoint  (trim)  must  be  found.  Also,  the  longitudi- 
al  and  lateral  d>-namics  may  not  be  decoupled  anymore,  so  *at 
re  A.  B  matrices  will  not  be  block-diagonal  as  for  the  unfailed 

ircrafL 

lome  challenges  for  this  problem  arc  that: 
it  is  a  multivariable  problem,  with  strong  couplings  usually 

ippearing  after  failures;  _ 

it  is  a  nonlinear  problem,  which  means  that  tnm  values  w. 
■hange  after  failures,  but  also  that  the  linearized  model  wiU 
:hangc  with  operating  conditions,  requiring  the  continuous  use 
3f  a  nonlinear  or  adaptive  algorithm  after  the  failure; 

.  the  system  may  be  highly  unstable,  leaving  very  little  time  for 


reconfiguration; 

-  actuator  authority  is  very  limited  and  sensor  noise  is  signifa- 
cant  (high-gain  feedback  is  not  an  option). 

The  use  of  a  least-squares  algorithm  to  identify  the  unknown  A, 
B  matrices  and  d  vector  is  attractive  and  feasible.  Such  algo¬ 
rithm  can  very  quickly  lead  to  accurate  esumates  (see 
[17]-[19])  There  is  a  need,  however,  for  reliable  uncertainty 
estimates  that  account  for  modeling  errors  and  for  methods  to 
perform  the  control  reconfiguration  in  a  robust  manner.  We  dis¬ 
cuss  some  of  the  design  trade^iffs  that  may  be  considered. 


Design  TVade-Offs 

Choice  of  Inputs:  The  elevator  command  is  typically  a  sym¬ 
metric  command  =  In  the  presence  of  failures,  it 

may  make  sense  to  separate  the  single  longitudmal  command 
into  two  separate  commands.  The  advantage  is  that  it  wiU  then 
be  possible  to  exploit  all  tlie  available  degrees  of  freedom.  The 
disadvantage  is  that  more  parameters  will  need  to  be  identified, 
requiring  more  computations  and  more  stringent  conditions  for 


parameter  convergence. 


Choice  of  Stales:.  The  so-called  short  period  approximation 
(involving  or  and  <7  only)  and  the  dutch-roll  approximation 
(involving  p,  p.  and  r  only)  arc  reduced-order  models  that 
accurately  reflect  aircraft  dynamics  over  shon  periods  of  time. 
Their  advantage  is  lliat  tlicy  require  less  parameters  and  less 
excitation  for  parameter  convergence.  On  the  other  hand, 
neglected  dynamics  arc  added  to  the  overall  dynamic  uncer¬ 
tainty  in  the  model. 

Choice  of  Tracked  Outputs:  Tlic  choice  of  output  variables  is 
not  unique,  and  a  good  choice  may  depend  on  speed  (pitch  rate 
at  low  speed,  vertical  acceleration  at  high  spe^s).  Certain  out¬ 
puts  lead  to  nonminimum  phase  zeros,  imposing  co^traints  on 
tracking  performance  and  the  choice  of  control  algorithms. 
Choice  of  Measured  Outputs:  The  availability  of  angle  of  attack 
and  sideslip  measurements  leads  to  knowledge  of  the  fiiU  state 
of  the  system  and  considerable  simplification  of  the  identifica¬ 
tion  and  control  algorithms.  However,  such  measurements  arc 
much  less  reliable  and  more  prone  to  damage  than  mertial  mea¬ 
surements  [20]. 

Choke  of  Control  Objectives:  Standard  flight  handling  qualities 
(cf.  [21])  arc  useful  to  define  objectives  of  the  reconfigured  con¬ 
trol  system.  However,  the  failed  aircraft  is  often  siptificantly 
less  symmetric  (and  therefore  decoupled)  than  the  ongmal  one. 
In  addition  to  stabilization  and  tracking,  one  must  consider  tnm 
recovery  and  decoupling.  The  control  objectives  may  also  have 
to  be  adjusted  according  to  the  damage  meurred  to  the  aircrafL 


Choke  of  Control  Law:  The  automatic  redesign  of  a  control  law 
is  a  major  challenge  for  reconfigurable  control.  Makmg  sure 
that  such  redesign  will  be  successful  m  all  cases  and  mco^orat- 
ing  measures  of  unceruinty  in  the  design  are  difficult  problems. 
In  addition,  a  tricky  issue  is  the  impact  of  the  controller  on  flte 
identification  accuracy.  For  example,  if  one  attempts  to  identify 
separately  the  effects  of  fire  two  elevators  and  5,^  tlie 

commanded  values  cannot  be  equal  or  the  elements  of  the  B 
matrix  will  not  be  identifiable.  In  other  words,  the  control  law 
for  the  unfailed  aircraft  cannot  be  used.  In  some  cases,  it  may 
also  be  necessary  to  add  excitation  to  obtam  sufficient  accuracy. 
The  best  way  to  balance  additional  excitation  and  control  per¬ 
formance  is  unfoitunately  an'incompletely  resolved  issue. 
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Example  1:  F-I6  Model.  We  first  consider  simulations  of  a 
fighter  aircraft  using  a  .model  developed  at  Wright  Patterson  Air 
Force  Base  (see  [17],  [18]).  The  model  is  a  linearized  model  of 
the  longimdinal  dynamics,  with  fairly  detailed  sensor  noise 
models.  Fig.  7  shows  the  estimated  values  of  the  parameter 
This  parameter  is  the  clement  of  the  B  matrix  indicating  the 
effectiveness  of  the  aileron  in  producing  a  pitching  moment 
An  unfailed  aircraft  is  assumed,  and  the  parameters  of  the 
model  are  estimated  using  a  least-squares  algorithm.  The  solid 

line  shows  the  parameter  estimate  using  the  full-order  model, 
while  the  dashed  line  is  obtained  with  the  reduced-order  model. 
Surprisingly,  the  estimate  using  the  reduced-order  moJ  J  is  bet¬ 
ter,  despite  the  additional  unmodcled  dynamics.  The  condition 
numbers  in  the  reduced-order  case  were  found  to  be  up  to  two 
orders  of  magm'tude  smaller  than  in  the  full-order  case,  indicat¬ 
ing  difficulties  with  excitation  in  the  full -order  case  (ways  to 
address  this  problem  are  discussed  in  [18]). 


Figure  7;  Af^  estimate 
Stochastic  estimates  of  uncertainty 


^c  stochastic  and  sensitivity  estimates  of  uncertainty  are 
s  own  m  Figs.  8  and  9  respectively,  for  the  full  order  model, 
urves  or  the  reduced-order  model  are  qualitatively  similar, 
previously,  the  stochastic  measures  are  somewhat  optimistic 
while  the  sensitivity-based  measures  are  generally  conservauve. 

E^rnple  2:  AIAA  Model.  Experiments  involving  failures  were 
performed  usmg  the  model  developed  for  the  AIAA  design 


challenge  [22].  This  is  a  detailed  nonlinear  simulation  of  a 
twm-engine  fighter  aircraft.  Wc  give  the  results  here  to  illus¬ 
trate  the  potential  of  least-squares  identification  methods  for 
rcconfigurablc  control.  The  reduced-order  model  of  the 
unfailcd  aircraft  was  determined  through  a  least-squares  to  be 


^  0.05  0.00  0.02h  r  o.OOh 

-6.74  -0.06  0.05  (^e\  0.19 

+  0.13  -  0.12  0.09  .  -r  0.05 

0.93  7.87  -  0.78  U^J  -0.37 

l-O.Ol  0.10  -2.64  J  1-0.27, 

The  units  for  a.  (},  5^.  and  are  degrees,  and  for  p.  q  and 
r  are  degrees  per  second.  After  a  failure  of  one  of  the  elevators, 
the  failed  aircraft  was  identified  to  be 


^  0.00  0.02  O.OOh  f  0.04y 

-3.47  -0.07  0.14  -1-36 

+  0. 13  -0. 14  0.12  .  +  0. 10 

-7.44  8.12  -0.81  3.27 

1-1-13  0.12  -2.65  j  0.19, 

It  was  found  that  it  was  reasonable  to  assume  that  the  A  matrix 
remamed  block-diagonal  after  the  failure,  and  tliis  was  enforced 
in  the  least-squares.  On  the  other  hand,  strong  couplings 
appeared  m  the  ft  matrix,  and  the  constant  term  became 
nonzero.  The  failure  led  to  a  reduction  of  the  effectiveness  of 
the  elevators  (by  50%.  from  -6.74  to  -3.47).  a  rolling  moment 
(-7.44)  and  a  yawing  moment  (-1.13)  appearing  with  the  eleva¬ 
tor  command,  as  well  as  a  constant  pitching  moment  (-1.36)  and 
a  constant  rolling  moment  (3.27).  From  these  numbers,  one  can 
deduce  tliat  an  elevator  is  blocked  at  a  position  of  approxi- 
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matcly  1/3  (3.47/6.74)  of  a  degree  from  the  previous  irrm  jmsi- 
tion  VVhich  elevator  is  blocked  can  be  deduced  from  the  signs 
of  the  elements.  However,  a  more  interesting  quesuon  for 
research  is  how  the  estimated  parameters  with  the  uncertainties 
can  be  used  for  the  automatic  redesign  of  a  muluvanab  e  con¬ 
trol  law. 

4.  Conclusions 

Obtaining  reliable  estimates  of  uncertainty  in  the  idraufic^on 
of  dynamic  models  is  an  important  step  towards  buil^g  adap- 
dve  systems  with  good  robusmess  and  transient  perforrnmee 
oropcrtics.  In  this  paper,  we  discussed  the  qualitative  differ- 
between  several  methods  to  measure  parametric  tmeer- 
Snty  in  the  presence  of  model  error.  We  discussed  the  adv^^ 
Uges  of  a  sensitivity-based  analysis  of  the  error  and  lUustrated 
J  results  with  experimental  dau.  A  chaUengmg  and  «citmE 
area  of  application  is  reconfigurable  flight  control.  We  d^- 
cussed  the  richness  of  this  problem  through  a  hst  of  the  wide 
choices  to  be  made,  and  illustrated  the  appheauon  of  least- 
squares  mctliods  using  two  aircraft  models. 
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Abstract 

A  multivariable  model  reference  adaptive  control  algo¬ 
rithm  is  presented  for  the  case  when  the  high-frequency 
gain  matrix  is  unknown.  Only  an  upper  bound  on  the 
norm  of  the  matrix  needs  to  be  known  a  priori.  A  trans¬ 
formation  of  the  parameters,  with  a  sort  of  hysteresis,  is 
used  to  guarantee  that  a  controller  matrix,  which  is  nor¬ 
mally  the  inverse  of  the  estimate  of  the  high-frequency 
gain  matrix,  remains  nonsingular.  It  is  shown  that  all  the 
signals  in  the  adaptive  system  arc  bounded  and  that  the 
tracking  error  and  the  regressor  error  converge  to  zero  for 
all  bounded  reference  inputs.  Furthermore,  exponential 
convergence  is  achieved  when  the  regressor  vector  is  per¬ 
sistently  exciting. 


system  remain  bounded,  that  the  output  error  converges 
to  zero,  and  that  the  regressor  error  is  in  L2  and  converges 
to  zero,  independently  of  the  richness  of  the  signals  used 
as  reference  inputs.  We  also  prove  that  exponential  con¬ 
vergence  is  achieved  when  persistency  of  excitation  con¬ 
ditions  are  met.  Aside  from  the  nontrivial  multivariable 
extensions,  an  original  contribution  of  our  paper  is  also 
the  exponential  convergence  result  (only  asymptotic  sta¬ 
bility  is  achieved  by  Lozano  et  at  (1990)  algorithm). 

Finally,  our  success  in  applying  the  hysteresis  transfor¬ 
mation  also  suggests  that  such  transformation  may  prove 
helpful  to  solve  other  problems  where  singularity  regions 
must  be  avoided,  such  as  in  adaptive  pole  placement,  and 
in  nonlinear  control  using  linearization  techniques. 

1  Definitions  and  facts 


Introduction 

Single-input  single-output  model  reference  adaptive  con¬ 
trol  (MRAC)  results  have  been  extended  to  continuous¬ 
time  multivariable  systems  by  several  authors,  e.g.,  El¬ 
liot  Ic  Wolovich  (1982),  Singh  L.  Narendra  (1984),  and 
Tao  &  loannou  (1988).  Unfortunately,  current  MIMO 
algorithms  require  significant  a  priori  knowledge  or  con¬ 
straints  on  the  high-frequency  gain  matrix.  Either  the 
high-frequency  gain  matrix,  Kp,  must  be  known  (fully  or 
partially,  e.g.,  Kp  diagon^J  with  the  signs  of  the  diagonal 
elements  known),  or  it  must  satisfy  some  positive  definite¬ 
ness  condition  {e.g.,  there  exists  a  known  matrix  S  such 
that  SKp  >  0). 

In  the  SISO  case,  the  problem  of  relaxing  the  require¬ 
ment  of  knowledge  of  the  sign  of  the  high-frequency  gain 
was  first  solved  by  using  controllers  based  on  the  so-called 
Nussbaum  gain,  cf.  Mudgett  &  Morse  (1985).  Because  of 
the  limited  practical  use  of  these  controllers,  Lozano  et  at 
(1990)  proposed  a  completely  different  approach.  In  their 
algorithm,  the  controller  parameters  are  obtained  from 
the  estimated  parameters  by  applying  a  transformation 
with  a  sort  of  hysteresis. 

In  the  MIMO  case,  the  problem  of  unknown  high- 
frequency  gain  matrix  is  even  more  difficult.  The  Nuss¬ 
baum  gain  approach  is  not  directly  applicable  to  MIMO 
MRAC  algorithms.  However,  using  the  hysteresis  idea  of 
Lozano  et  at  (1990)  with  some  significant  modifications, 
we  show  in  this  paper  how  to  design  a  MIMO  MRAC 
algorithm  so  that  stability  is  guaranteed  even  if  the  high- 
frequency  gain  matrix  is  unknown  (only  an  upper  bound 
on  the  norm  of  the  high-frequency  gain  matrix  and  an 
upper  bound  on  the  norm  of  the  matrix  of  unknown  pa¬ 
rameters  are  needed).  We  show  that  all  the  signals  in  the 
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Definition  1  ;  Hcrmite  normal  form 
For  any  square,  strictly  proper,  and  nonsingular  plant 
P{s)  €  5i’’^’’{s)  (the  set  of  {p  X  p)  matrices  whose  el¬ 
ements  are  rational  functionsof  a),  there  exists  a  unique 
matrix  H{s)  €  5R'”‘’’(s),  called  the  Hermite  normal  form, 
such  that: 

P{s)  =  His)Uis) 

17(s)  6  S?'”‘'’(s)  and  lim  U{s)  =  Kp  nonsingular 

«  — OO 


where  dh;j{s)  <  r,-  —  1  and  a  is  arbitrary,  but  fixed  a 
priori. 

Deftnition  2  :  Projections 

Following  Stewart  ic  Sun  (1990),  Chapter  I,  let  T  be  a 
subspace  of  SJ"  of  dimension  r  <  n  and  let  the  columns  of 
the  orthogonal  matrix  Qa-  €  form  an  orthonormal 

basis  for  X,  with  Q'^Qx  =  I-  The  matrix 

Px  =  QxQ^ 

defines  the  orthogonal  projection  onto  X,  i.e.  V  z  £  5i", 
Pxz  €  X  and  (r  —  Px^)  -L  X.  It  can  be  easily  verified 
that  the  matrix  Px  €  3?”’^”  is  symmetric,  idempotent, 
and  independent  of  the  choice  of  Qx-  The  matrix 

Pi  =I-Px 

defines  the  orthogonal  projection  onto  X^,  the  subspace 
of  5?"  of  dimension  n  —  r,  that  is  the  orthogonal  com¬ 
plement  of  X.  The  matrix  Pi  £  is  also  symmetric, 

idempotent,  and  independent  of  the  specific  choice  of  Qx- 
Let  Vc(A)  be  the  range  of  A,  i.e.  the  subspace  of  5?" 
spanned  by  the  columns  of  the  matrix  A.  Then,  tliere 
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exists  a  square  orthogonal  matrix  U  such  that 
B  0  ]  with  B  full  rank  and  one  has  that 


Deanition  3  .-  Singular  value  decomposition 

,  .  g>-x"  have  rank  r.  then  the  singular  value  dc- 
of  A  is  given  by  (c/.  Stewart  ^  Sun  (1990). 

Chapter  I) 

U^j^V  =  E  U^U  =  I  = 

„  ^  5pmxmin(n..n)  y  g  5>nXmin(m.n) 
where  the  matrices  If  €  K 
E  =  diag{o-.}  with 


A^u^^'n^than  D(s)  €  (^1- 

^  -1  matrices)  ar; D  denotes  the  maximum  polynomial 

in  the  i-th  row  of  D(s).  dc.D  the  maximum  poly- 
®  lleree  in  the  i-th  column  of  Zl(s).  and  80  the 
m^mum  polynomial  degree  in  D(s).  In  the  paper.  P(s) 
denotes  the  transfer  function  of  a  linear  time  invariant  op¬ 
erator  While  PH  denotes  the  output  of  the  operator  (in 
Te  time  domain)  with  input  u(t).  Finally  if  x  is  a  vector, 
possibly  function  of  time,  we  denote  lx(t)|,  the  Euchdean 
norm  of  x  at  time  i- 

2  MRAC 

2.1  Fixed  controller 

We  use  a  standard  controller  structure  (see.  e  g..  ElUot 
fe  Wolovich,  (1982)).  Assume  that  the  plant  is  dwen 
by  a  square,  no.nsingular.  strictly  PfoP«.  minimum 
phase  transfer  function  matrix  P(s)  €  5?  (s).  assume 

that  the  Hermite  normal  form  of  P(s)  (see  Definition  1)  is 
known,  assume  that  an  upper  bound,  v.  on  the  maximum 
of  the  observability  indices.  is  known.  The  equation 

of  the  plant  is  =  PH  and  the  following  controller 

structure  is  considered 

r  =  Afo[ro] 

u  =  Cor  +  A~'CH  +  ^”‘^ts'pl 

where  ro  €  »'’  is  a  reference  signd  Co  € 
nonsingular.  A(s),C(s)  €  ^  and 

Afo(s)  G  K'”‘'’(s).  A^o(5)  is  a  proper  stable  transfer  func¬ 
tion  rnatrix  and  A(s)  is  a  diagonal  matrix  of  Hurwitz  poly¬ 
nomials.  A(s)  =  diag{A(s)} .  with  aA(s)  -  u  -  1. 

By  combining  {Nh.Dr}.  a  right  matrix  (r^  ion  d^ 
scription  (MFD)  of  F(s)  (P(s)  ==  Nn{s)D,Hs)),  with 
the  equation  of  the  controller  (1).  the  output  yp  can  be 

expressed  as 

_  //^((A-C)Dii- DAR)"‘ACo[r]  (2) 

Now.  let  the  reference  model  M(s)  = 

H(s)  is  the  Hermite  normal  form  of  P{s).  The  mode 
output  y„  =  HMo[ro]  =  H[r].  U  ^n  be  sWn  (see  e.y 
Sastry  Bodson  (1989))  that  3Co*  €  C  (s).  D  (s) 

e  solution  of  the  Diophantine  equation. 

Nr[{A-C')Dr  -  D'Ar]-’ACo  =  H  (3) 


so  that  model  matching  is  achlev^,  with  A  ‘P  P^^°P«. 
and  A“‘C  strictly  proper.  In  particular,  Co  —  Kp  non- 
singular.  aD*  <  and  3r.-c*  <  SA;.  It  c^  also  be 

shown  (c/.  de  Mathelin  ic  Bodson  (1992)).  tha^t  the  matri¬ 
ces  Co  c*  D'  are  unique  If  we  assume  knowledge  of  the 
observlbUity  indices  {v.)and  add  the  following  constraint 

dciD’  <  V.  -  1  Vi  (4) 

2.2  Adaptive  controller 

la  order  to  implement  adaptation  in  the  model  reference 
design,  we  must  estimate  Co*.C*.I>*.  which  satisfy  the 
matching  equality.  The  matching  equality  (3)  is  equiva¬ 
lent  to 

I  =  Co*P“'P  + A”*C*  +  A“'D*P  (5) 

Define  L^s)  =  diag(I(s)}.  with  i(s)  Hnrwitz.  3l(s)^^  d 
where  d  is  the  maximum  degree  of  all  elements  of  E  (s). 
Then  multiplying  both  sides  of  (5)  by  L  '  and  applying 
both  transfer  function  matrices  to  u  leads  to 

£->[„]  =  Co*(i/I)~*[yp]  +  L-\A-'C'[u]  +  A-‘D*[yp]) 

(6) 

Since  H(s)  is  known  a  priori,  (6)  is  an  equation  where 
the  unknown  parameters  appear  linearly  which  can  be 
rewritten  as 

Z,-’H  =  CoiHL)-'[yp]  -I-  (^) 

where  xl>  is  the  regressor  vector  and  6"  is  the  matnx  of 
unknown  controller  parameters  whose  p  first  colurnns  are 
equal  to  Cq.  The  following  error  equation  can  be  derived 

=  =  =  (8) 

where  9  is  the  estimate  of  3*  and  ^  is  the  parameter  er¬ 
ror  We  will  assume  that  the  foUowing  normalized  least- 
squares  identification  algorithm  with  covariance  resetting 
is  used: 

1  -  a  -  _  P'l>^2_  y,  7  >  0 

4>  -  e  - 

P  ^  with  P(0)  =  P(H  =  >  0(9) 

1  + 

where  e.  b  the  identifier  error  (8)  and  (r.)  are  the  reset¬ 
ting  time  instants  such  that 

(p,}  =  {rdA„.*(P(r-r))  =  *^i  0<fci<to} 

where  A„„  denotes  the  largest  eigenvdue.  T^^nx  F 
is  discontinuous  at  the  resetting  instants  r^,  with  the  brm 
on  the  right  P{n)  equal  to  a  predeterinined  matrix  ho/ - 
If  Jb,  =  0  the  standard  least-squares  algorithm  without 
resetting  is  obtained.  For  hi  >  0.  the  algorithm  will  reset 
whenever  the  F  matrix  becomes  too  small. 

Deftnition  4  :  Persistent  Excitation 
A  bounded  vector  if  is  persistently  exciting  (PE)  iff  3  a  > 
0,  3  S  >  0  such  that 


/•'»+«  r 

/ 

J  <a 


dr  >  O'!  V  fo  ^  0 
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Definitioa  5  :  Sufficieat  Excitation 
A  bounded  vector  ^  is  sufficiently  exciting  (SE)  iff  3  a  >  0 
such  that 


V  fo  >  0  3  S(to)  >  0 


such  that 


dr  >  oti 


Note  that  if  a  signal  is  PE  then  it  is  also  SE.  but  the 

converse  is  not  true. 

Lemma  1  :  Properties  of  the  estimatloa  algorithm 


and 


Assuming  that  V-  €  the  estimation  algorithm  (9)  i.as 

the  following  properties 

1.  0  <  P  <  to/,  tj"'/  <  G  ioot, 

2.  (rjc+i  —  rje)  is  bounded  below  V  t. 

3  S  - - ^ - T-  P  ^  ;-^7 -  G  Poo- 


4. 


t77.  d,  P  G  /'2- 


s  If _ - r-  is  not  SE  or  if  hi  =  0  (no  resettings); 

•  ^  finite  set. 


P.  r  —  ,  ,  G  /-a 


i  +  Uv- 


C.  If 


•  P  G  /i  • 

•  P(t)  and  d(i)  converge  to  some  Poo  and  doo- 

- tr-Tuj  is  SE: 

(l+vV'^4')'' ^ 

•  lim<—oo  i^Ct)  =  0- 


3-  eL^nL^.  where  <5^  =  flo  -  r . 

4.  If  Um,_oo  d(.t)  =  0  then  lim.-oo  dc{t)  =  0. 

5.  If  P(t)  and  <5(t)  converge  to  some  P^  and  doo  then 
dc(t)  converges  to  some  dcoo- 

Since  the  transformation  that  we  propose  to  achieve  this 
objective  is  complex,  we  proceed  to  present  the  transfor¬ 
mation  in  several  steps. 

3.1  Simple  hysteresis 

Define  Co^(0  as  being  equal  to  Co{0  when  ^fiun(^o)  ts 
sufficiently  large,  and  staying  constant  when  <rnun(Co) 
becomes  too  small  (o-min  denotes  the  smallest  singular 
value).  More  precisely,  since  an  upper  bound  on  \ICp\  is 
known,  a  lower  bound,  tr,  on  ermuCCo  ),  ia  also  known.  Let 
n™.. (G)(0))  >  cr,  and  define  Coy  such  that  the  relation¬ 
ship  between  <r„Un(Go^  )  and  ^aua(Go)  can  be  described  by 
Fig.  1.  Define  {<t}  as  the  time  instants  when  the  value  of 


7.  If 


is  PE  and  ti  >  0: 


•  lim«-oo  ^(0  =  0  exponentially. 


The  proof  is  in  de  Mathelin  (1993). 

Comments: 

The  specific  adaptation  algorithm  is  chosen  for  its  spe¬ 
cial  properties  given  in  Lemma  1.  Most  important  is  the 
fact  that  the  parameter  error  d  converges  (although  not 
necessarily  to  zero)  independent  of  richness  properties  of 

d- 


3  Hysteresis  transformation 

Although  it  might  not  be  obvious  a  priori,  the  develop¬ 
ment  of  a  stability  proof  for  the  MRAC  algorithm  reveals 
that  the  parameter  Go  in  the  control  law  (1)  must  be  non- 
singular.  More  precisely,  the  matrix  Cg  ‘  must  remain 
bounded.  Since  this  property  is  not  guaranteed  by  the 
parameter  identification  algorithm  (9),  the  parameter  Go 
used  in  the  control  law  will  be  dissociated  from  the  esti¬ 
mate  Co.  Hereafter,  we  will  use  the  subscript,  c,  to  mark 
the  controller  parameters,  as  opposed  to  the  estimated  pa¬ 
rameters.  So,  instead  of  Co,,  being  equal  to  the  estimate 
Co,  a  transformation  is  applied  to  make  sure  that  Co^  bas 
a  bounded  inverse  even  if  Co  is  close  to  singularity. 

To  complete  the  proof  of  stability  and  convergence  of 
the  MRAC  algorithm,  one  finds  that  the  following  prop¬ 
erties  are  required  from  the  parameter  transformation. 

1.  Cg-J  €  Loo. 

2.  fie  G  Loo. 


Coy  becomes  frozen  and  {T*}  as  the  time  instants  when  a 
jump  in  the  value  of  Coy  occurs  (when  Co^  is  unfrozen  3.nd 
becomes  equal  to  Go  again).  The  levels  at  which  freezing 
and  unfreezing  occur  arc  chosen  to  be  different  to  prevent 
repeated  switchings  arbitrarily  closely.  Precisely  stated, 
we  let  <r,„;„(Co(0))  >  <r.  To  =  0,  and  V  1:  >  1 


ti  is  such  that 
Tt  b  such  that 


f  <rm;»(G)(tfc))  =  <r/6 
\  <r,^u(Co(t))  ><r/b  Vr.-i  <<<ri 

r  <’’oa„(Co(Ta))  =  <r/a  Qg\ 

{  <r„:„(G)(<))  <  <r/n  Vt,<t<T^  > 


In  Fig.  1,  the  parameters  a  and  b  are  respectively  equal  to 
2  and  4,  but  other  values  can  be  selected,  provided  that 
1  <  a  <  6  <  oo.  Note  that,  since  G)(0  b  continuous, 
tk  <Tk  <  t)t+x  <  Tfc+i  Vt  >  1  and 


<r„u(G)(0)  >  '^/b  if  Tk-i  <t<l^ 

<r,n;a(Co(<))  <  ah  if  tk<t<Tk 

The  relationship  between  Coy{t)  and  Co(t)  b  described 
by 


Coy{t)  = 


{ 


Cc(i) 

Co(tk) 


if  Tk-1  <t<tk 
if  tk  <t<Tk 


(11) 


so  that  (r„i„(Co^(t))  >  a/b  V  t.  The  following  parameter 
transformation 

Co^  =  Coy  =  e  (12) 


would  then  appear  to  be  adequate  at  this  stage.  However, 
it  can  be  checked  that  the  transformation  (12)  does  not 
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guarantee  property  3.  although  it  guarantees  all  the  oth¬ 
ers  Therefore,  the  following  additional  transformation  is 
necessary. 


3.2  Simple  hysteresis  +  projection 

To  solve  the  problem  of  guaranteeing  property  3,  the  ma¬ 
trix  L  in  the  controller  is  also  transformed,  using  the 
following  parameter  transformation 

e{i)  if  Tk-i  <  t  <  tk  VI:  >  0 
Ui)  =  {  6(1)  +  Pit)P,(i)iP^{t)P,it))-HcT,{i)  -  clit)) 

if  tjfc  <  t  <  Tic  VA:  >  0 

(13) 

where  Pp  is  the  rectangular  matrix  made  of  the  first  p 
columns  of  P.  Since  P  is  symmetric.  P  is  the  matrix 
made  of  the  first  p  rows  of  P.  It  can  be  checked  that, 
with  this  transformation.  Co.  -  Co,-  Further,  the  trans¬ 
formation  is  well-defined  and  all  the  necessary  properties 
are  guaranteed  as  long  as 

3,>0  s.t.  ><  Vi,<t<n  (H) 

In  the  SISO  case,  it  can  be  proved  that  condition  (14) 
is  always  satisfied,  so  that  the  parameter  transformation 
(13)  is  always  well-defined  (it  is  the  transformation  pre¬ 
sented  in  Lozano  et  al  (1990)).  Indeed,  given  the  least- 
squares  estimation  algorithm  (9).  ^(P'V)  =  0  between 
resettings,  from  which  one  can  deduce  that 

(15) 


00^(1)=: 


where  ry  is  the  last  covariance  resetting  time  instant,  i.c. 
ry  <  t  <  ry+i.  Therefore. 


<rB,;o(Cj'(t))  >  <r  —  o'm»x(F’p  (0) 


1^(0)I 


(16) 


In  the  SISO  case,  Pp  is  a  vector.  Based  on  (16).  if  for  some 

t  >  0.  1F,(01  <  <^(1  -  l/“)l^' 

such  that  lCb(t)l  >  3  fc  >  1  such  that  Ta-i  < 

i  <  tg.  Therefore,  condition  (14)  is  always  satisfied. 

Unfortunately,  in  the  MIMO  case,  there  is  no  gu^- 
antee  that  condition  (14)  is  respected  V  t  >  0.  In¬ 
deed,  if  there  is  excitation  in  some  directions  but  not 
others,  <r„-..(Pp(t))  could  tend  to  zero  while  o-„„(Pp(‘)) 
would  not.  Therefore,  one  could  conceive  that,  given  cer¬ 
tain  initial  conditions  and  given  certain  reference  signals, 
lim,_„ir„-,„(Pp(t))  =  0  with  lim._oo  o-„u.(Co(0)  < 

In  that  case,  the  transformation  (13)  would  not  be  well- 
defined  V  t  >  0.  Therefore,  property  2  would  not  be 
guaranteed  for  this  transformation  in  the  MIMO  case. 


3.3  Selective  hysteresis  -f  projection 

To  solve  the  problem  of  the  pseudo-inverse  Pp{Pp  Pp) 
not  being  necessarily  bounded  in  the  MIMO  case,  an  addi¬ 
tional  modification  is  brought  to  the  parameter  transfor¬ 
mation.  Essentially,  the  problem  is  that  if  the  estimated 
matrix  Co  persists  in  being  singular,  despite  the  presence 
of  excitation  in  certain  (but  obviously  not  all)  channels, 
then  the  matrix  Co.  must  be  updated  selectively  in  the 
directions  where  there  is  excitation.  The  major  challenge 
is  to  achieve  this  objective  while  maintaining  the  bound¬ 
edness  of  Co'  and  all  the  necessary  properties. 


We  consider  the  following  transformation 

{e{i)  if  Tfc-i  <  t  <  U  VI:  >  0 

e(t)  4-  p(t)V>(t)E^'(0^r(0{^^{0'^?(0  ~  Cj'(i)) 
if  tk<t<Tk  Vk>0 

(17) 

where  Vp^p'Uf  is  a  lower  rank  approximation  of 
Pp{P^Pp)~'.  Specifically,  we  consider  the  following  par- 
tioned  singular  value  decomposition  of  Pp 

[  Up  U.  fp^  [  Vp  K  ]  =  [  E.  ] 

and  E  =  diag{(ri}  cri>o^i  Vt<j  (18) 


such  that 


Ep  = 


■  iri  0  .  0 

0  <r2  . 

0  .  0  iTNp 


>  (ffATp  +  l  +  6p)I 

>  {cp  + 


E. 


< 


irwp  +  i 


—  1 
0 


(p  +  {p  —  Np  —  l)ip 


<  {(p  +  {p  —  Np  —  l)ip)J 


cp  +  6p  0 
0  Cp 


(19) 


Np  is  the  size  of  Ep  and,  therefore,  Up  €  Vp  € 

5j2pvx^P^  g  gjPK(p-Wp)^  V,  €  The 

constanU  cp  and  6p  are  arbitrarily  chosen,  but  must  sat¬ 
isfy  the  following  conditions 


<rko _ _ 

0  (1  +  (p  —  l)/ap)(iTe*  +  1®(0)1) 

0  <  5p  =  Cplap  with  ap  >  1  (20) 


where  oa-  is  a  known  upper  bound  of  10*1-  Basically,  E< 
contains  the  singular  values  of  Pj  no  greater  than  cp  and 
the  ones  which  are  sufficiently  close  to  them  (by  increment 
«p)  Ep  contains  the  larger  singular  values  such  that 
<r„;„(Ep)  >  +  The  difference  hp  is  found  to 

be  required  to  guarantee  continuity  in  the  proof. 

Finally,  /f?  =  V/Uo’’,  where  V>  is  an  orthonormal  basis 
of  the  subspacc  AT  =  n(Co^  Up  U  J)  of  size  Np  and  Uo  is  an 
orthonormal  basis  of  the  subspace  y  =  Tl{Co  U^Uj)  also 
of  sire  Np.  Therefore,  given  the  properties  of  projections 
(c/.  Definition  2) 

Vj  Vf  =  I  and  VfV^  =  P R(Co^  UpUj)  = 

UqUo  =  I  and  UqUJ  =  Pfc^CaU.u]')  “ 


If  <ra>-.u{P^{t))  >  then  Np  -  P,  and  Px  =  Py  -  {■  If 
o-„:„(Fj(t))  <  fP  then  Np  <P  and  Px  and  Py  are  given 

by 


P;^  =  CojUp{UpCo^CojUp)  'UpCl^ 

Py  =  {I-CoU,{UjcJCoU,)-'UjC^)  (22) 


Indeed,  referring  to  Definition  2,  we  let  A  -  Co,VpU 
and  U  =  [Up  Ut  1,  so  that  B  =  Co^f/p  and  Px  is 
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ho. 


given  by  (22).  This  assumes  that  D  =  CojUp  has  full 
column  rank,  .-.e.,  nonsingular.  Horvever. 

since  a,  is  nonsingular.  {UjCj  Co,  Up)  ‘  is  welWefined 
and  the  number  of  columns  of  Vj  is  equal  to  Np  A  simdar 
derivation  applies  for  Py.  From  (15).  it  can  be  checked 


°  (23) 

where  ry  is  the  last  covariance  resetting  time  instant,  t.c. 
rj  <t<  ry+i.  Therefore,  given  (19)  and  (20) 


<r^UUTCo)  >  <r  -  (cp  +  (P  - 

(24) 

so  that  (UTCTCoU,)-^  is  well-defined  and  the  number  of 
columns  of  Uo  is  indeed  equal  to  Np. 

Comments; 

To  guarantee  some  continuity  properties  and  the 
uniqueness  of  Vj  and  Uo.  the  Gram-Schmidt  orthogo- 
nalization  procedure  with  memory  described  in  the  ap¬ 
pendix  is  applied  to  Pa-  and  Py  to  compute  V>  and 
Uo  respectively.  Using  this  procedure,  we  also  have 
that  if  <r^4Pj(^i))  >  CP  then  Np  =  p,  P?(0  =  I. 

Vp(t}E-^(t)uf(t}  =  Pp(t)(Pp^(0Fe(0)"'.  Pa¬ 

rameter  transformation  (17)  is  simply  identical  to  the  pa¬ 
rameter  transformation  (13). 

It  can  also  be  shown  that  Np  >  1.  Indeed,  if  3  <  >  0 
such  that  Np  =  0,  then  from  (24),  ^ ml  a  (Co)  >  rr/u. 
Therefore,  3  1:  >  0  such  that  Tk-i  <  t  <  ik  and 
6c(i)  =  ®(0-  words,  if  Pp  was  to  be  small  in 

all  directions,  then  it  would  mean  that  there  had  been 
enough  excitation  so  that  Co  would  be  close  to  Co  and  one 
would  be  out  of  the  singularity  region.  Therefore,  Np  >  1, 
<rmin(Ii/>)  >  Cr>  +  */>  and  the  parameter  transformation 
(17)  is  well-defined.  Basically,  the  inverse  (Pj Pp)“*  has 
been  replaced  by  a  lower  order  inverse,  whose  existence 
and  boundedness  can  be  guaranteed. 

Finally,  it  can  be  verified  that  Cg^'  €  Poo-  Indeed,  if 
Tjt-i  <  i  <  tk  oi  i{  tk  <  t  <  Tk  and  <rroi„(P/(t))  >  <p, 
then  C^(t)  =  Cj'^(t)  whose  inverse  b  always  bounded.  If 

tk  <  i  <Tk  and  ^  min  (p/)<cr..then 


Cj;(0  =  Co^'Cf)  3-  t/p(t)tfJ(t)(Co^,(t)P?(f)  -  Co^ (0) 

=  Up{t)u^{t)c^t)R^i)  +  u.{t)uT{t)cT  it) 

(25) 


Therefore,  using  (21)  and  (22), 

clco^  =  UpuTclVfVj'Co,UpUl  +  u.uTc'^CoU,uT 


UjcT,CojUp 


UjCjCoU^ 


where  U  =  ^  Up  Ut  ],so  that  <rm;o(Co.-)  ^  o/6. 

Finally,  given  (17)  and  all  the  previous  definitions,  the 
parameter  transformation  with  hysteresis  will  have  the 
following  form 

fi,(t)  =  e(t)  +  p(0Q(f)  (2'^) 


0  if  Tk-i  <  t  <  c  vfc  >  0 

Ppii)iP^{t)Rpi^)r'^cl,  w  -  W) 

/  ik  <t<Tk  Vfc  >  0 
I  and  o-„;,(Pj(t))  >  Cp  (28) 
Vp{l)Tj\t)U?it)iCl^it)RUt)  -  Co^(t)) 

/  tk<t<Tk  Vfc  >  0 

^  and  <rBun{P^(0)  £ 

Lemma  2  :  Properties  of  the  hysteresis  transfer- 

matioa 

Assuming  that  ij)  €  and  that  the  parameter  estima¬ 
tion  algorithm  is  defined  by  (9),  then 

1.  The  transformation  is  always  uniquely  defined. 

2.  Co”‘  e  Poo. 

3.  Q  e  Poo,  6c  e  Poo,  and  <i>c  G  Poo,  where  =  6c-6*. 

=  eL^nLoo- 

5.  If  lime— oo  ^{0  =  0  then  lim.  — dett)  =  0- 
6  -  Tk-i)  and  (Tk  -  tk)  are  bounded  below  Vfc. 

7.  (Pt)  and  {(*)  arc  finite  sets. 

8.  If  P(t)  and  <l>{i)  converge  to  some  Poo  and  <^oo  tlicn 
^^(<)  converges  to  some  4>cao‘ 

The  proof  is  in  de  Mathelin  (1993). 

Coraments: 

The  fact  that  Q  €  Poo  is  obtained  because,  even  though 
P~'(j)  is  not  necessarily  bounded  as  t  increases,  !Cp’  ex¬ 
ists  and  is  always  bounded  when  Q  yf  0.  In  other  words, 
should  P“’  be  unbounded,  either  Ep'  will  continue  to 
exist  or  the  algorithm  will  come  out  of  the  hysteresis  re¬ 
gion.  The  fourth  property  is  most  important;  it  shows 
that  the  main  property  on  R  in  the  estimation  algorithm 
remains  valid  when  the  estimated  parameter  error  <l>  is 
replaced  by  the  controller  parameter  error  <j>c-  The  fact 
that  the  intervals  (tk  —  Tt-i)  ^nd  (Tk  —  fi)  are  bounded 
below  comes  from  the  normalized  nature  of  the  adapta¬ 
tion  algorithm  and  from  the  separation  of  the  freezing  and 
unfreezmg  levels  o  and  fc.  It  eliminates  the  possibility  of 
having  an  infinite  number  of  jumps  in  a  finite  interval  of 
time.  The  seventh  property  tells  us  that  there  is  a  finite 
number  of  passages  in  the  hysteresis  loop.  After  a  while, 
the  algorithm  will  settle  and  no  more  jumps  will  occur. 
The  property  is  the  consequence  of  the  convergence  of  <6 
(not  necessarily  to  zero).  The  algorithm  could  actually 
settle  inside  the  hysteresis  loop.  In  that  event,  we  will 
see  in  the  following  section  that  the  stabUity  properties  of 
the  adaptive  algorithm  are  preserved.  The  last  property 
is  obtained  because  of  the  difference  Sp  between  Ep  and 
Ek  and  because  of  the  use  of  the  particular  Gram-Schmidt 
orthogonalization  procedure  defined  in  appendix. 

Note  that  from  (27)  and  (28)  with  (25), 

Uf(t)Cl(t)  =  Uf{t)Cl^(t)Rl(t) 

ul(t)cl(i)  =  uT(t)c;'"(t)  +  Ek(i)v^(i)^ 

where  Tk  is  the  last  covariance  resetting  time  instant.  Ba¬ 
sically,  this  particular  parameter  transformation  unfreezes 
Go^  in  the  directions  where  (Cj  —  Co  )  is  sufficiently 
small,  using  the  knowledge  that  there  has  been  sufficient 
excitation  in  those  directions. 
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4  Stability 

If  we  define  the  model  signals,  ^  signals  ^  when 
d  =  0,  then  we  can  define  the  regressor  error,  e^,  as 

e^  =  d  “ 

Theorem  1  -•  Stability  of  the  MRAC  system 
Consider  the  MIMO  MRAC  system  with  the  parameter 
estimation  algorithm  and  the  hysteresis  transformation 
described  previously.  If  the  reference  input  r  €  ioo  and 
is  piecewise  continuous,  then 

.  All  states  of  the  adaptive  system  are  bounded  func¬ 
tions  of  time. 

•  The  output  error  co  =  Vp  -  G 

limi.,oo  eo  =  0. 

«  The  regressor  error  =  tfr  -  V"™  €  Aoo  H  Lj  and 
limt— oo  e^  =  0- 

The  proof  is  an  extension  of  the  SISO  proof  of  Sastry 
Dodson  (1989)  and  can  be  found  in  de  Mathclin  (1993). 

Appendix 

Gram-Schmidt  orthogonalization  with  memory 
Let  W{t)  e  K'”''’  be  a  matrix  of  rank  r  function  of  time 
and  let  h  >  0  be  a  constant  to  be  fixed  later. 

1.  At  time  t  =  0,  apply  tlie  following  procedure. 

kV  ,,,=  f-eiti,  column  of  kV  s.t. 

V'l  =  kk'j.(i)  with  _  jnaxt  11V*|  n,in 

lyO)  ^  (1V<|),=0) 

=  It^^Hh  column  of  kk^e)  s.t. 

r.  =  lV<J,\with  t(^)min 

ky<^)  = 

=  lt(eth  col  of  kV<"-‘>  s.t. 
r.  =  |H^^j)^>)|  =  maXfc|kV<"-*^lt<^’niin 

=  ° 

Then,  the  matrix 

1 

is  an  orthogonal  basis  of  7l(kK(0)),  the  space  of  dimension 
r  spanned  by  the  columns  of  1T(0).  The  order  of  selection 
of  the  columns  of  kK(0),  (A:('b|<=o,  =  1.  -  -  - .  r  is  uniquely 

defined  by  this  procedure.  Therefore,  the  matrix  X(0)  is 
also  uniquely  defined. 

2.  Keep  the  initial  order  of  selection  of  the  columns. 

the  orthogonalization  of  W{t),  i  >  0,  until 

3<  =  <i(kK)>0  31<y<r  31</<P  s.t. 

+ ^  <  1^< 

where  W^°'>  =  VK.  Then,  at  i  =  U{W)  the  procedure  de¬ 
fined  for  i  =  0  is  applied  for  the  computation  of  X(<i),  and 


a  new  order  of  selection  of  the  columns.  (irG)  is 

found. 

3.  Finally,  continue  this  procedure  for  the  orthogonaliza¬ 
tion  of  W{t),  t  >  <i(kK).  The  set  {<k(kK)}  are  the  time 
instants  when  the  order  of  selection  of  the  columns  of  W 
is  changed.  The  matrix  X(0  will  be  uniquely  defined  for 
all  t  >  0. 

The  procedure  is  applied  with  kK  =  leading  to 

X  =  V>,  and  similarly  to  Py,  leading  to  Uq.  The  advan¬ 
tage  of  using  this  Gram-Schmidt  orthogonalization  with 
memory  is  that  the  matrices  Vf  and  Uo  are  uniquely  de¬ 
fined  for  given  matrices  Px  and  Py.  The  constant  h  is 
chosen  sufficiently  small  that  no  vector  Y;  may  become 
equal  to  zero,  while  preventing  the  order  of  selection  of 
the  columns  to  change  an  infinite  number  of  time  during 
a  finite  time  Interval.  Note  that  using  the  Gram-Schmidt 
orthogonalization  procedure  with  memory  for  Px  —  I  or 
Py  =  I  leads  to  Vy  =  /  and  Uo  =  I.  Finally,  it  can  be 
shown  [cj.  dc  Mathelin  (1993))  that  if  h  <  X,  then  the 
Gram-Schmidt  orthogonalization  with  memory  is  always 
well-defined  for  Vf  and  Uq. 
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Pseudo-burst  Phenomenon  in  Ideal  Adaptive 

Systems* 

MARC  BODSONt 

A  phenomenon  is  described  which  is  similar  to  the  well-known  burst 
phenomenon  but  is  observed  in  ideal  systems ^  that  iSj  adaptive  systems 
without  noise  or  unmodeled  dynamics. 


Key  Words — Adaptive  control,  convergence  analysis;  harmonic  analysis;  Lyapunov  methods, 
stability. 


Abstrad — A  phenomenon  is  described  which  is  similar  to 
the  well-known  burst  phenomenon.  With  examples,  it  is 
shown  that  the  responses  of  adaptive  systems  may  exhibit 
large  transients  of  the  error  signal,  separated  by  quiet 
periods.  In  contrast  to  the  burst  phenomenon,  the  responses 
observed  in  this  paper  correspond  to  ideal  systems,  that  is, 
adaptive  systems  without  noise  or  unmodeled  dynamics. 
Also,  in  accordance  with  the  theory,  the  bursts  do  not  occur 
indefinitely.  Two  examples  arc  discussed.  The  first  is  shown 
to  be  very  similar  to  the  burst  phenomenon.  The  second  is 
found  to  be  quite  different,  with  its  origin  being  in  the 
internal  excitation  generated  through  the  interaction  of 
adaptation  and  control.  An  averaging  analysis  is  provided, 
which  highlights  the  uses  and  limitations  of  averaging  for  the 
analysis  of  adaptive  systems. 

1.  INTRODUCTION 

The  burst  phenomenon  in  adaptive  systems  has 
been  reported  by  several  authors  (see  e.g. 
Anderson  (1985)  for  a  clear  exposition  in  the 
context  of  adaptive  identification  and  control 
and  Sethares  et  al.  (1989)  for  an  interesting 
discussion  in  the  context  of  adaptive  hybrids  for 
echo  cancellation).  In  the  presence  of  noise,  the 
outputs  of  adaptive  systems  exhibit  large 
transients  at  certain  time  intervals.  The  bursts 
are  related  to  the  parameter  drift  due  to  the 
noise  and  can  be  prevented  using  update  law 
modifications. 

It  is  perhaps  not  so  well  known  that  a  very 
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similar  phenomenon  can  be  observed  in  idea! 
adaptive  systems,  that  is,  adaptive  systems 
without  noise  or  unmodeled  dynamics.  It  is  the 
purpose  of  this  paper  to  present  and  explain  this 
phenomenon,  which  will  be  called  pseudo-burst 
phenomenon.  We  will  discuss  two  examples.  It 
will  be  shown  that  the  explanation  of  the  first 
example  is  much  related  to  that  of  the  burst 
phenomenon,  although  there  is  no  parameter 
drift  since  there  is  no  noise.  We  will  also  discuss 
another  example,  where  a  single  burst  is 
observed.  This  is  a  more  complicated  example, 
and  we  will  show  that  its  explanation  lies  in  an 
unusual  interaction  between  adaptation  and 
control. 

In  Section  2,  we  review  the  burst  phenomenon 
with  a  simple  example.  In  Section  3,  we  present 
two  simulations  exhibiting  the  pseudo-burst 
phenomenon.  In  Section  4,  we  investigate  in 
detail  the  origin  of  the  bursts  observed  in  the 
simulations  and  explain  the  pseudo-burst  pheno¬ 
menon.  In  Section  5,  we  investigate  the 
application  of  averaging  techniques.  We  con¬ 
clude  the  paper  with  some  observations  about 
the  implications  of  the  results  on  the  design  of 
adaptive  stabilization  and  control  schemes. 

2.  THE  BURST  PHENOMENON 
Since  the  explanation  of  the  pseudo-burst 
phenomenon  is  much  related  to  the  explanation 
for  the  burst  phenomenon,  it  is  useful  to  review 
it  briefly.  An  example  will  help  the  presentation. 
Both  continuous-time  and  discrete-time  ex¬ 
amples  can  be  found,  which  exhibit  parameter 
drift  and  the  burst  phenomenon.  However, 
simpler  examples  can  be  constructed  in  discrete¬ 
time,  and,  therefore,  we  present  such  an 
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example.  Consider  the  plant 

xik+V)  =  a*x{k)  +  b*u{k),  (2.1) 

y(k)=x(k)  (2-2) 

with  the  objective  of  tracking  the  output  of  the 
reference  model 

x„{k  +  1)  =  a^„(A:)  +  b„r(k),  (2.3) 

y^{k)=x„{k)  (2.4) 

where  r{k)  is  the  reference  input.  The  controller 
is  chosen  to  be 

u{k)=  c{^k)r(k)  +  d(Jc)y{k).  (2-5) 

The  closed-loop  transfer  function  matches  the 
reference  model  transfer  function  for  the  fixed 
value  of  the  parameters 


which  are  called  the  nominal  controller  para¬ 
meters.  A  standard  identification  algorithm  is 
the  projection  algorithm  of  Goodwin  and  Sin 
(1984). 

a{k  +  1)  =a(/c) 

i^,{k^-a{k)y{k-\)-b{k)u{^k-\))yi,k-\) 
l+u\k-l)+y\k-l)  ’ 

(2.8) 

bik  +  l)  =  b(k) 

( y(k)  -a(k)y(k  -  I)  -b(k)u{k  -  l))n(k  -  1) 

1 -1- M^(/c  -  1)  + /(^  - 1)  ’ 

(2.9) 

where  a{k),  b(k)  are  the  estimates  of  a*  and  fe* 
at  time  k.  The  controller  parameters  c{k),  d{k) 
are  obtained  from  the  estimates  a(^k),  b{k)  as  if 
they  were  the  true  plnnt  paranieters,  using 
(2.6)-(2.7).  A  modification  has  to  be  introduced 
to  avoid  division  by  zero  if  b{k)  =  0,  but  this  is 
immaterial  to  the  present  discussion.  Further,  it 
is  known  that  a{k)—*-a*,  b{k)—*b  and, 

therefore,  c(k)-^c*,  d{k)-^d\  if  the  plant 
input  u{k)  is  sufficiently  rich.  Here,  this  means 
that  the  input  should  have  at  least  one  sinusoidal 
component.  It  is  also  known  that,  for  this 
scheme,  the  condition  on  the  plant  input  u{Jd)  is 
equivalent  to  the  same  condition  on  the 
reference  input  r(/c). 

When  the  input  is  not  sufficiently  rich,  the 
parameters  do  not  necessarily  converge,  even 
though  the  output  error  yik)—y,„{k)  tends  to 
zero  as  k-^°=.  This  is  not  problematic,  unless 
measurement  noise  is  added  to  the  adaptive 


(2.6) 

(2.7) 


system,  so  that  (2.2)  is  replaced  by 

yik)  =  xik)  +  nik).  (2.10) 

Then,  the  adaptive  parameters  move  across  the 
parameter  space  and,  due  to  the  multiplicative 
nature  of  the  update  laws  (2.8)-(2.9),  this 
movement  often  manifests  itself  in  the  form  of  a 
parameter  drift.  The  origin  of  the  problem  is 
that  the  error 

eik)=y(k)  -  a{k)y{k  -  1)  -  b(k)u{k  -  1) 

(2.11) 

which  is  used  for  adaptation  consists  mostly  of 
measurement  noise,  once  the  initial  transient  has 
passed.  The  parameters  a{k),  b(k)  are  used  to 
calculate  the  controller  parameters  c{k),  d(k) 
which,  in  turn,  are  used  to  control  the  plant.  If 
the  error  (2.11)  is  small,  the  parameters  move 
slowly  and  the  stability  of  the  closed-loop  plant 
is  determined  by  the  condition  that  the 
magnitude  of  the  closed-loop  pole  be  less  than 
one,  i.e. 


\a*  +  b’d{k)\  = 


6* 

b{k) 


{a„.  -a{k)) 


<  1. 


(2.12) 


When  the  parameters  drift,  they  may  reach  the 
region  where  (2.12)  is  violated,  in  which  case  the 
inner  control  loop  becomes  unstable  and  a  large 
error  burst  occurs.  Since  the  output  signal 
becomes  suddenly  much  larger  than  the  noise, 
sufficient  excitation  is  present  and  brings  the 
parameters  back  to  the  stability  region.  But  it  is 
only  a  matter  of  time  before  the  adaptive 
parameters  drift  again  and  reach  the  instability 
region,  leading  to  another  burst. 

Figures  1  and  2  show  simulations  of  the 
adaptive  system  with:  a*  =0.819,  6*  =0.181, 
a„  =  0.670,  b„  =  0. 165.  These  values  correspond 
to  the  step-response  equivalents  of  a  plant 
1/(5 -1- 1)  and  a  reference  model  l/(s-l-2) 
sampled  at  200  msec.  The  noise  is  the  sampled 
equivalent  of  a  sinusoid  of  amplitude  0.05  and 
frequency  5radsec“'  and  the  reference  input 
r(^lc)  =  0.  Figure  1  shows  the  response  of  x{k), 
which  exhibits  the  large  transients  characteristic 
of  the  burst  phenomenon.  Figure  2  shows  the 
location  of  the  closed-loop  pole 

t„{k)  =  a*  +  b*d(k).  (2.13) 

It  can  be  seen  that  the  pole  drifts  across  the 
stability  border  until  the  output  becomes  large 
enough  to  bring  the  estimates  back  in  the 
stability  region.  The  phenomenon  repeats  itself 
indefinitely. 

The  phenomenon  of  instability  due  to 
parameter  drift  was  also  reported  by  Rohrs  et  al. 
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Fig.  1-  Burst  phenomenon,  x{k). 


(1982)  and  discussed  in  detail  by  Astrdm  (1984). 
The  major  difference  between  the  instabilities 
found  by  Rohrs  et  al.,  and  the  burst  phenome¬ 
non  described  here  is  that,  in  Rohrs  et  al.,  a 
continuous-time  example  is  considered  and  it 
also  includes  unmodelcd  dynamics.  In  both  cases 
however,  the  problem  can  be  avoided  by  using 
one  of  the  several  update  law  modifications 
proposed  in  recent  years.  A  dcadzonc  stops 
adaptation  when  the  error  e{k)  is  small  enough 
to  consist  mostly  of  noise.  This  modification 
assumes  that  a  bound  on  the  amplitude  of  the 
noise  exists  and  is  known.  More  sophisticated 
techniques  have  also  been  proposed  (see 
loannou  and  Sun  (1988)  for  a  review  and  an 
interesting  comparison  of  their  properties). 


3.  THE  PSEUDO-BURST  PHENOMENON 
The  adaptive  system  investigated  in  this 
section  is  a  model  reference  adaptive  control 
scheme  originally  proposed  by  Parks  (1966)  and 
extended  by  Narendra  and  Valavani  (1978).  We 
consider  an  nth  order  continuous-time  system 
with  transfer  function 


Hs)  = 


lis)  ’ 


(3.1) 


where  it  is  assumed  that  >  0,  P(s)  is  minimum 
phase  and  the  relative  degree  deg<3p(s)- 
degnp(5)  =  n-m  =  l.  The  objective  is  to  design 
a  controller  to  match  the  output  of  a  reference 
model  with  transfer  function 


Mis)  = 


k„n„{s) 


(3.2) 


where  k„  >  0,  M(s)  is  stable  and  is  minimum 
phase,  Acgi„{s)  =  dcg3.p{s)  and  deg/z„(s)  = 
degnp(s).  Further,  for  the  stability  of  the 
adaptive  rule,  the  reference  model  is  assumed  to 
be  strictly  positive  real. 

We  restrict  ourselves  to  the  case  when  the 
plant  is  of  order  2  (n  =  2,  m  =  1).  The  controller 
is  then  given  by 

Li{t)  =  +  c,(r)iv,(r)  +  d„(r)yp(t) 

+  d^{^)w2{i),  (3.3) 


where  «(/)  and  yp{()  are  the  input  and  output  of 
the  plant  and  r{t)  is  the  reference  input.  The 
signals  «’,(/)  and  W2(/)  are  given  by 

vv,(t)  =  —  Aiv,(r) -I- «(t),  H',(0)=0,  (3.4) 

W2it)  = -kw2it)+yp{0,  n'2(0)  =  0-  (3-5) 

with  A>0  such  that  n„,(s)=5  +  A.  The  output 
error  is  given  by 

eo(.0=yp(.0-ym{0,  (3-6) 

where  y„(t)  is  the  output  of  the  reference  model 
with  input  r(r).  The  adaptive  parameters  are 
updated  according  to 


Co  =  -gear. 

(3.7) 

C|  =  -geaWj, 

(3.8) 

do  =  -geoyp. 

(3.9) 

d,  =  -geoW2. 

(3.10) 

where  g  >  0  is  called  the  adaptation  gain. 

It  can  be  shown  that  there  exist  unique 
parameter  values  Cq,  cf,  do  and  d*  such  that  the 
closed-loop  transfer  function  matches  the  re¬ 
ference  model  transfer  function.  However,  the 
stability  of  the  adaptive  scheme  is  guaranteed 
even  if  the  parameters  do  not  converge  to  their 
nominal  values.  In  particular,  it  can  be  shown, 
through  a  Lyapunov  analysis,  that  all  state 
trajectories  remain  bounded  and  that  eo(0^0 
>.oo.  Further,  the  integral  of  el  from  t=0  to 
(  =  00  is  bounded  (cf.  Narendra  and  Valavani 
(1978),  Sastry  and  Bodson  (1989)).  A  remark¬ 
able  result  is  that  the  plant  is  not  required  to  be 
stable;  the  scheme  can  be  used  to  stabilize  an 
arbitrary  unstable  system  provided  that  the  plant 
order  is  known,  its  gain  k^  is  positive,  its  relative 
degree  is  1  and  it  is  minimum  phase.  All  states 


AUTO  29:4-1 


932 


M.  Bodson 


remain  bounded,  whatever  the  system  poles  are 
and  whatever  the  initial  conditions  are — 
although  the  bounds  themselves  depend  on  such 
parameters. 

We  call  the  system  (3.1)-(3.10)  the  ideal 
adaptive  system,  meaning  that  all  the  assump¬ 
tions  on  which  the  scheme  is  based  are  satisfied 
and,  in  particular,  no  noise  is  present.  Note  that 
the  analysis  results  mentioned  above  preclude 
the  possibility  of  a  burst  phenomenon  for  the 
ideal  system,  since  Cq(/)— >^0  as  However, 

we  will  see  in  the  following  examples  that,  within 
the  results  of  the  theory,  some  rather  unex¬ 
pected  phenomena  can  be  observed. 

Example  1.  We  first  consider  simulations  for  the 
adaptive  system  when 


^  "f"  3 

(3.11) 

^^^^“(5  +  1)(3  +  5)' 

(3.12) 

The  input  r  =  2  sin  {(),  the  adaptation  gain  g  =  1 
and  all  initial  conditions  are  set  to  zero.  The 
parameter  A  =  3  (following  from  (3.12)  since 
n„{s)=s  +  A).  Figures  3  and  4  show  the  output 
error  response  Cq  over  250  and  1500  sec, 
respectively.  The  responses  show  several  bursts, 
very  reminiscent  of  the  bursts  observed  in 
Section  2.  However,  the  output  error  eventually 
converges  to  zero,  consistently  with  the  theoreti¬ 
cal  analysis  which  guarantees  that  eo(0“^0  as 
r— >oo  (although  not  at  all  in  a  monotonic 
manner!).  Figures  5  and  6  show  the  responses  of 
the  adaptive  parameters  Cq  and  d^,  respectively. 
The  responses  of  c,  and  d,  are  similar  (except 
that  the  jumps  at  the  time  of  the  bursts  are 
smaller^)  and  are  omitted  for  brevity.  Even- 


Fig.  3.  Example  1,  e„(()  over  250  sec. 


t  By  virtue  ot  the  Lyapunov  analysis,  all  the  signals  in  the 
adaptive  system  arc  bounded,  as  well  as  their  derivatives. 
Tire  “jumps"  observed  in  the  responses  are  not  discon¬ 
tinuities  but  rapid  variations  o(  signals  with  bounded 
derivatives. 


FiG.  4.  Example  1,  over  1500  sec. 


tually,  the  parameters  converge  to  some  values 
which  are  not  the  nominal  values  (those  are 
found  to  be  Co  =  l,  c*  =  0,  do  =  — 8,  and 
d*  =  20).  It  can  be  checked  that  the  values  are 
such  that  the  closed-loop  transfer  function  is 
stable  and  matches  the  reference  model  transfer 
function  at  <y  =  1  rad  sec  .  This  condition  is 
insufficient  to  guarantee  parameter  convergence 
to  the  nominal  values,  but  is  sufficient  to 
guarantee  convergence  of  the  output  error  to 
zero. 

The  bursts  observed  in  this  simulation  look 
very  similar  to  the  bursts  of  Section  2  and  we  will 
see  that  they  also  have  a  similar  explanation. 
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Fig.  7.  Example  2,  e„(r)  over  200 sec. 


Fdowevcr,  their  origin  is  not  in  parameter  drift 
caused  by  noise  and  the  bursts  cannot  continue 
indefinitely  by  virtue  of  the  analysis  results. 
Therefore,  we  call  this  a  pseudo-burst 
phenomenon. 

Example  2.  In  the  second  simulation,  the 
adaptive  control  system  is  identical  but  the  plant 
is  replaced  by 

(3.13) 

Figures  7  and  8  show  the  output  error  response 
over  200  and  1000  sec,  respectively.  Now  the 
response  exhibits  a  single  burst:  after  130  sec, 
the  error  reaches  very  small  values  and  remains 
close  to  zero  for  approximately  20  sec,  before 
taking  large  values  again.  Again,  in  accordance 
with  the  theoretical  results,  the  output  error 
eventually  converges  to  zero  and  all  the  states 
are  bounded. 

Figures  9  and  10  show  the  responses  of  the 
adaptive  parameters  Co  and  Ci,  respectively.  In 
this  case,  we  omitted  the  responses  of  and  d^, 
which  are  similar  to  those  of  Co  and  c, 
(respectively).  As  the  error  reaches  zero  after 


Fig.  8.  Example  2.  c„(t)  over  1000 see. 


130  sec,  the  parameters  seem  to  converge  to 
some  values  before  moving  again.  Their  final 
values  are  not  equal  to  the  nominal  values,  equal 
to  Co  =  1,  cT  =  2,  da  =  -6,  and  d*  -  14  in  this 
example.  The  convergence  of  all  four  parameters 
would  require  at  least  two  sinusoids.  Again,  it 
can  be  checked  that  the  parameters  converge  to 
values  such  that  the  closed-loop  transfer  function 
is  stable  and  matches  the  reference  model 
transfer  function  at  £u  =  1  rad  sec”'.  For  different 
initial  conditions,  different  parameter  values 
would  eventually  be  reached,  with  only  a 
constraint  to  tie  on  a  two-dimensional  subspace 
of  1R^ 

The  (unstable)  plants  of  Example  1  and 
Example  2  are  both  stabilized  by  the  same 
adaptive  control  schemes,  despite  completely 
different  dynamics.  However,  the  road  to 
stability  is  not  as  smooth  as  expected.  While  the 
second  example  exhibits  a  burst  similar  to  the 
bursts  of  the  first  example,  we  will  see  that  its 
origin  is  quite  different  and  raises  interesting 
questions. 

3. 1-  Practical  relevance  of  the  examples 

As  the  reader  may  wonder  whether  these 
examples  were  carefully  contrived,  we  would 
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like  to  indicate  how  they  were  obtained.  The 
examples  were  created  to  illustrate  the  stabi¬ 
lization  properties  of  adaptive  systems,  as  part  of 
a  homework  given  in  a  graduate  course  on 
adaptive  control  taught  in  the  Fall  of  1989.  At 
the  time,  integer  values  were  chosen  for 
simplicity  and  no  adjustments  have  been  made 
since  to  the  parameters.  The  fact  that  these 
examples  were  obtained  at  the  first  trial  with 
unstable  second  order  systems  may  be  an 
indication  that  the  phenomenon  is  quite 
common.  In  fact,  after  this  paper  was  submitted 
for  publication,  Rohrs  (1990)  presented  a 
discrete-time  example  similar  to  Example  1  of 
this  paper.  The  paper  of  Hsu  and  Costa  (1987) 
also  presents  an  example  of  burst  phenomenon 
for  an  ideal  adaptive  system.  However,  the 
algorithm  under  consideration  (with  <t- 
modification)  does  not  guarantee  Lyapunov 
stability  of  the  adaptive  system  or  convergence 
of  the  output  error  to  zero.  The  phenomenon 
observed  in  the  paper  of  Hsu  and  Costa  is 
comparable  to  the  burst  phenomenon,  where 
persistent  bursting  occurs  because  of  parameter 
drift  (in  this  case,  the  o-modification  acts  like  a 
disturbance). 

The  responses  observed  in  the  simulations  are 
clearly  undesirable  in  engineering  applications. 
For  example,  adaptive  stabilization  algorithms 
have  potential  application  to  the  design  of 
self-repairing  flight  control  systems,  a  subject  of 
recent  interest  in  the  U.S.A.  (cf.  Chandler 
(1984)).  Bursts  and  slow  convergence  would  be 
highly  undesirable  in  such  an  application.  At 
issue  is  the  transient  response  of  the  adaptive 
systems.  Researchers  are  well  aware  of  this 
problem,  but  few  useful  results  have  been 
obtained.  Several  papers  have  derived  conver¬ 
gence  bounds  based  on  persistency  of  excitation 
conditions  (cf.  Kreisselmeier  and  Joos  (1982); 
Cannetti  and  Espana  (1989),  Zang  and  Bitmead 
(1990)).  These  conditions  are  not  satisfied  here. 


FlC.  11.  Example  1,  Lyapunov  function  v(/). 


Fig.  12.  Example  2,  Lyapunov  function  v(t). 


Even  so,  the  simulations  do  not  contradict  the 
fundamental  results  of  the  Lyapunov  analysis 
(which  is  valid  without  the  persistency  of 
excitation  condition,  and  is  also  of  the  source  of 
the  parameter  convergence  results  when  the 
condition  is  satisfied).  To  illustrate  this  point, 
Figs  11  and  12  show  plots  of  Lyapunov  functions 
for  the  systems  of  Examples  1  and  2, 
respectively.  The  function  is  indeed  found  to  be 
monotonically  decreasing.  The  simulations  do 
not  contradict  the  results  of  the  available  theory, 
but  rather  show  the  surprising  responses  that  are 
possible  within  its  scope. 

3.2.  Comment  about  the  simulations 

Another  comment  is  in  order  about  the 
simulations  themselves.  The  simulations  were 
carried  out  using  the  simulation  package 
SIMNON.  We  found  that  the  responses  were 
quite  sensitive  to  the  error  tolerance  used  by  the 
integration  algorithm  and  set  by  the  user  (cf. 
Elmqvist  et  al.  (1990)).  Decreasing  the  value 
from  the  default  of  10“^,  we  found  visible 
differences  in  the  responses  but  they  remained 
nearly  invariant  for  errors  less  than  or  equal  to 
10“*.  The  plots  shown  above  were  obtained  for 
the  error  bound  set  to  10"®.  The  integration 
algorithm  was  the  Dormand-Prince  4/5  algo¬ 
rithm  and  we  checked  that  the  responses  were 
similar  to  those  obtained  with  a  Runge-Kutta- 
Fehlberg  4/5  algorithm.  It  should  be  noted, 
however,  that  only  the  locations  and  sizes  of  the 
bursts  were  affected  by  the  simulation  para¬ 
meters.  The  pseudo-burst  phenomenon  was 
observed  for  all  values  of  the  error  tolerance  that 
were  tried,  from  10“^  to  10“®.  There  again,  the 
phenomenon  itself  was  found  to  be  “robust”  and 
not  a  singular  event. 
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4.  ANALYSIS  OF  THE  EXAMPLES 

Example  1.  For  the  analysis  of  the  examples,  we 
introduce  the  following  terminology.  We  call 
inner  loop,  the  feedback  loop  consisting  of  the 
plant  (3.1)  with  the  feedback  law  (3.3).  We  will 

call  outer  loop  or  adaptation  loop,  the  additional 
loop  consisting  of  the  update  laws  (3.7)— (3.10). 
We  now  proceed  with  the  analysis  of  the 
pseudo-burst  phenomenon  observed  in  the  first 
example.  The  explanation  is  similar  to  the 
explanation  of  the  burst  phenomenon  in  Section 
2;  the  parameters  evo've  along  trajectories  which 
repeatedly  cross  the  stability  boundary  of  the 
inner  loop.  However,  this  occurs  in  the  normal 
evolution  of  the  adaptive  parameters,  and  it  is 
not  caused  by  parameter  drift  due  to  measure¬ 
ment  noise. 

Assuming  that  the  parameters  vary  slowly,  the 
stability  of  the  inner  loop  is  determined  by  the 
closed-loop  characteristic  polynomial  for  frozen 
parameters,  which  is  given  here  by 

+  (XiS^  +  o'l-t  +  a'(,  =  -F  (1  -  C|  -  du)s^ 

+  (—5  -F  2C|  —  6d(i  —  di)s 

+  (3-c,-3d, -9do).  (4.1) 

Note  that  a  third  order  polynomial  such  as  (4.1) 
is  stable  if  and  only  if  or,,  >0,  otj  >  0  and  to  >  0, 
where  (cf.  Kamen  (1990)) 

to=a^a2-ao■  (4-2) 

The  variable  /<,  has  a  similar  role  as  the  variable 
of  the  same  name  in  Section  2.  In  particular, 
to  =  0  if  and  only  if  the  polynomial  has  a  pair  of 
roots  on  the  imaginary  axis. 

We  proceed  with  our  analysis  by  inspecting 
more  closely  the  first  burst.  Figure  13  shows  the 
detail  of  the  output  error  Cq  between  150  and 
200  sec,  while  Fig.  14  shows  the  evolution  of  to- 
These  responses  are  very  reminiscent  of  Figs  1 
and  2.  Shortly  before  the  burst,  tg  becomes 
negative,  indicating  that  two  poles  of  the  inner 
loop  crossed  the  imaginary  axis.  After  the  burst. 


Fig.  14.  Example  1,  /<,(/)  between  150  and  200  sec. 


to  becomes  positive  again,  an  indication  that  the 
burst  has  had  a  stabilizing  effect  (the  other 
variables  o',,  and  0-2  were  found  to  be  positive 
across  the  burst). 

A  further  test  consists  in  calculating  the  zeros 
of  the  polynomial  (4.1)  for  the  values  of  the 
parameters  immediately  before  and  after  the 
burst.  These  are  given  by 

t  =  168  sec  (before);  Cq  =  1.64,  c,  =  5.77, 
do  =-5.06,  d,=2.40, 

=  -1.01,  J2.3  =  0.36  ±  5.92/, 
t  =  178  sec  (after);  Cg  =  2.18,  c,  =  5.31, 
do  =-6.80,  d,  =2.52, 

=  -1.21,  52.3= -0.64  ±6.48/. 

Clearly,  the  burst  is  accompanied  by  the  crossing 
of  a  pair  of  complex  roots  from  the  right  to  the 
left  half  plane.  The  detail  of  the  burst  in  Fig.  13 
further  highlights  this  fact,  as  the  period  of  the 
oscillation  in  the  burst  is  approximately  Isec 
which  corresponds  to  the  natural  frequency  of 
the  complex  roots. 

In  this  example,  the  parameters  repeatedly 
cross  the  stability/instability  boundary.  Indeed, 
there  is  no  property  of  the  algorithm  that  would 
prevent  this  from  happening.  In  fact,  the  path 
followed  by  the  parameters  is  known  to  be 
usually  curved  or  spiraled  from  the  initial 
conditions  to  the  nominal  parameter  values  (cf. 
the  circular  arcs  of  Astrdm  (1984),  and  the 
averaging  analysis  of  Sastry  and  Bodson  (1989)). 

Example  2.  Given  the  insight  gained  from 
Example  1,  we  may  be  tempted  by  the  following 
explanation;  the  adaptive  parameters  converge 
to  values  such  that  model  matching  is  nearly 
achieved  after  130  sec,  but  these  values  make  the 
inner  loop  unstable  and  a  burst  results,  after 
which  the  parameters  converge  to  values  which 
guarantee  the  inner  loop  stability.  Testing  inner 
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loop  stability  can  be  achieved  as  for  Example  1, 
by  testing  the  closed-loop  characteristic  polyno¬ 
mial,  which  is  now  given  by 

+  a2S^  +  nriS  -f-  OTo  =  +  (3  — 

+  (1  —  4do  —  di)s  -1-  (3  —  c,  —  3do  —  d,).  (4.3) 

However,  calculation  of  the  variables  ao,  qtz  and 
to  throughout  the  simulation  reveals  that  these 
quantities  remain  positive  for  all  time  (except  for 
a  brief  initial  transient).  In  other  words,  the 
inner  loop  remains  stable  throughout  the 
response  and  burst  is  not  due  to  an  instability  of 
the  inner  loop.  This  means  that  the  burst  of  the 
second  example  cannot  be  explained  in  the  same 
manner  as  the  first  example. 

Since  the  inner  loop  is  not  responsible  for  the 
instability,  its  origin  can  only  be  in  the  adaptive 
part  of  the  controller  or  in  the  interactions 
between  the  inner  and  outer  loops.  The  question 
is  therefore  that  of  the  outer  loop  stability,  or 
adaptation  stability.  A  useful  way  to  address 
such  an  issue  is  by  means  of  an  averaging 
analysis  (cf.  Anderson  et  al.  (1986);  Bodson 
(1988,  1989);  Sastry  and  Bodson  (1989)). 

Averaging  allows  us  to  approximate  the 
time-varying  differential  equation  describing  the 
adaptive  system  by  a  time-invariant  differential 
equation.  The  approximation  of  the  averaged 
system  is  valid  if  g  is  sufficiently  small.  We  will 
follow  this  approach  in  the  next  section,  but  let 
us  just  say  for  now  that  while  averaging  indeed 
predicts  the  existence  of  an  equilibrium  point 
(that  is  reached  after  130  sec),  it  also  fails  to 
predict  the  burst;  the  burst  is  eliminated  for 
small  values  of  the  gain  g.  However,  the 
example  has  another  curious  property;  if  the 
adaptation  gain  is  made  larger,  the  burst  also 
disappears.  Figure  15  shows  simulations  where 
g  =  1  for  the  first  130  sec,  then  g  =  0. 1  for  the 
next  70  sec  (this  was  done  to  eliminate  the 


Fig.  15.  Example  2,  1=0-^  130 sec,  g=0.1 

for  !  =  130-^200  see,  and  (a)  5  =  1  for  1  =  200 3(X)  sec.  (b) 
g  =  0.5  and  j  =  2  (superimposed)  for  r  =  200— >  300 sec. 


Fig.  16.  Example  2,  e„(f)  between  75  and  175  sec. 


possible  effect  of  transients)  and,  for  the  last 
lOOsec,  three  cases  are  shown,  that  is,  g  =  0.5, 
1,  2.  The  instability  arises  quickly  when  g  =  1 
while  Co  remains  negligible  in  the  other  cases.  In 
reality,  a  longer  simulation  reveals  that  a  burst 
arises  when  g  =  2,  but  it  occurs  later  (after 
400  sec)  and  is  also  smaller.  If  g  is  switched  to 
10  instead  of  2,  a  still  smaller  burst  occurs.  In 
other  words,  there  is  a  sort  of  resonance  for 
g  =  1- 

To  gain  some  insight  into  this  phenomenon, 
we  explore  the  responses,  of  the  adaptive  system 
around  t  =  130  sec.  Figures  16  and  17  show  Cq 
and  Co,  respectively,  between  75  and  175  sec. 
Recall  that  the  reference  input  has  a  single 
frequency  component  at  at  =  1  rad  sec  '.  The 
frequency  is  indeed  observed  on  e„  before 
130  sec,  together  with  a  third  harmonic.  On  the 
other  hand,  the  gain  Co  has  a  slowly  moving 
component  and  a  large  second  harmonic.  This  is 
easy  to  explain;  the  input  u  has  a  term  CoC .  Since 
Co  has  a  second  harmonic,  a  third  harmonic  is 
introduced  at  the  input  of  the  plant  and  is 
reflected  at  the  output.  The  update  law  for  Cq  is 
Co  =  -gcoc,  so  that  the  correlation  of  the  first 
and  third  harmonics  of  Co  the  first  harmonic 
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Fig.  17.  Example  2,  c„(f)  between  75  and  175  sec. 
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or  r  creates  the  large  second  harmonic  of  Cq.  The 
puzzling  fact  is  what  happens  after  130  sec. 
Then,  for  a  brief  while,  consists  of  a  growing 
oscillation  of  a  pure  second  harmonic,  and  Cq 
consists  of  a  first  and  third  harmonic.  The 

situation  is  comparable  to  the  situation  before 
130  sec,  except  that  Cq  and  Cq  have  been 
interchanged!  The  output  error  contains  a 
second  harmonic  that  is  not  present  in  the 
reference  input  and  that  is  due  to  the 
fluctuations  of  the  adaptive  parameters.  We  will 
show  that  it  is  this  interaction  between 
adaptation  and  control  that  is  responsible  for  the 
unstable  oscillation  and  for  the  burst. 

We  now  provide  an  approximate  analysis  of 
the  system,  based  on  the  observations  made 
about  the  frequency  content  of  the  signals.  The 
analysis  is  unconventional  and  exploits  modula¬ 
tion  properties  of  the  signals.  Several  ap¬ 
proximations  are  required  for  this  analysis,  but  it 
should  be  remembered  that  standard  methods  do 
not  provide  any  help,  while  the  analysis  given 
here  leads  to  a  reasonable  prediction  of  the 
dynamic  behavior  observed  in  the  adaptive 
system. 

For  r  =  130  sec,  the  parameters  are  approxim¬ 
ately  constant,  with  values 

c,r  ==0.097,  cr  =  -0.41, 

^fo"  =  -0.62.  dr -1.53. 

For  these  values  the  closed-loop  transfer 
function  is 


0.097(5  -t-  1)(5  -t-  3) 

~  ?  + 4.025" -1-  1.935  +3.72 


(4.5) 


It  can  be  verified  that  for  s—jco  and 
(0=1  rad  sec“  ‘ 

F,(/)  =  0.32  -  0.31/  =0.35  -0.27/  =  M(/), 

(4.6) 

so  that  the  transfer  functions  are  nearly  matched 
at  the  frequency  of  excitation.  The  transfer 
functions  are  not  matched  for  all  5,  of  course, 
but  the  closed-loop  transfer  function  is  stable, 
with  the  closed-loop  poles  being  given  by 

5,  =  -3.77,  52.3= -0.13  ±0.99/.  (4.7) 

Note  that  while  the  transfer  function  is  stable,  it 
has  a  pair  of  weakly  damped  complex  poles.  The 
explanation  of  the  burst  that  we  propose  is  that 
the  adaptation  of  c„  acts  approximately  like  an 
integral  feedback  term  (with  a  gain  proportional 
to  g)  which  destabilizes  the  system.  It  can  indeed 
be  checked,  by  a  root-locus  argument,  that  the 
system  given  by  (4.5)  with  integral  feedback  has 
two  unstable  complex  roots  for  some  range  of 
the  feedback  gain.  To  show  that  the  adaptation 


of  Co  acts  like  an  integral  feedback,  assume  that 
a  small  sinusoidal  perturbation  is  added  to  the 
input  u.  Then,  the  corresponding  perturbation 
on  the  output  yp  is  given  by 

where  l/c^Pd  is  given  by  (4.4)-(4.5).  This  term 
is  integrated  in  the  adaptation  through  Co  = 
-gcor  and  is  injected  back  to  the  input  through 
(3.3).  This  feedback  is  illustrated  in  Fig.  18. 
Note  that  the  fluctuations  of  the  other 
parameters  d^),  Ci,  and  dx  have  been  neglected 
for  the  sake  of  this  argument.  The  system 
represented  in  Fig.  18  is  not  linear  time- 
invariant.  However,  the  feedback  path  is  similar 
to  a  modulation/demodulation  system,  and  this 
w'ill  allow  us  to  pursue  an  approximate  analysis 
in  a  linear  time-invariant  framework.  Recall  that 
r  =  2  sin  (t),  so  that  for  Sy,,  =  sin  (2t) 

6(e„r)  =  cos(t)-cos(3r),  (4.9) 

<5co= -g(sin  (/)  -  3  sin  (3r)),  (4.10) 

and 

Su  =g5  cos  (2/)  +  other  harmonics.  (4.11) 

In  other  words,  around  &i  =  2radsec  ',  the 
feedback  can  be  approximated  by  an  integral 
feedback  {S/3){g/s).  It  can  be  checked  that  the 
transfer  function  l/coPc\{^)  with  feedback 
(8/3)(g/5)  has  poles 

5,  =  -3.68,  52.3  =  0.14  ±1.86/,  ^4=-0-62. 

(4.12) 

Therefore,  the  integral  feedback  introduced  by 
the  adaptation  of  c„  destabilizes  the  system  by 
moving  the  complex  poles  of  (4.7)  to  the  right 
half  plane.  The  natural  frequency  of  the 
oscillation  is  about  2  rad  sec“'  which  corresponds 
to  the  oscillation  observed  on  Fig.  16. 

This  explanation  offers  some  intriguing  im¬ 
plications.  Before  and  after  the  burst,  the 
spectrum  of  the  input  contains  frequencies  not 
present  in  the  reference  input  and  introduced  by 
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the  adaptation.  The  effect  can  be  looked  at  from 
two  opposite  point  of  views.  In  one  way,  it  is  a 
destabilizing  effect  which  prevents  the  adaptive 
system  from  stabilizing  at  the  equilibrium  for 
t  =  130  sec.  From  another  pont  of  view,  one  may 
say  that  it  provides  additional  internal  excitation 
which  make  the  parameters  converge  closer  to 
their  nominal  values.  The  parameters  eventually 
converge  to  values  such  that  the  closed-loop 
poles  are  given  by 

5,.2=-0.22±0.98y,  J3  =  -4.51.  (4.13) 

The  closed-loop  poles  are  better  damped  after 
the  burst  than  before  (see  (4.12))  which  may  be 
considered  a  stabilizing  effect.  In  either  case, 
however,  the  poles  remain  far  from  the  model 
reference  poles. 


5.  AVERAGING  ANALYSIS 
Averaging  has  proved  to  be  a  very  useful 
technique  for  the  analysis  of  adaptive  systems. 
Unfortunately,  averaging  docs  not  provide  much 
help  in  explaining  the  examples  of  this  paper. 
On  the  contrary,  the  examples  provide  interest¬ 
ing  insights  into  the  limitations  of  averaging. 

The  averaged  system  corresponding  to  the 
adaptive  system  described  in  Section  2  was 
derived  in  Bodson  et  at.  (1986)  (also  available  in 
Sastry  and  Bodson  (1989)).  We  first  define  the 
parameter  error  to  be 

c,  -  cl  do  -  dl  d,  -  dr).  (5.1) 

The  averaged  system  is  given  by  (see  Sastry  and 
Bodson  (1989)) 


where 
A 


2kcI  L 


^av  ^■^av(^av)^av> 

Xm(y<^)'^ 


(5.2) 


X  +  B'^{<p))Mijo})SXd(o),  (5.3) 


where  S^dco)  is  the  spectrum  of  the  reference 
input,  x^(jco)  denotes  the  closed-loop  charac¬ 
teristic  polynomial  evaluated  at  jo,  a  function  of 
the  parameter  error  <p.  Xm{jo)  denotes  the  same 
polynomial  for  <^  =  0.  The  other  matrices  arc 
given  by  equations  (5.4)  and  (5.5). 


l  +  B{<p)  = 


1 

1 - 

Co 

4>i 

0  14-^ 

Co 


0 

0 


1  1  \ 


'*7 

^0 


1  + 


<pl 


^0 


0  / 


(5.4) 


1 


\ 


s  +  X 


\  j  -E  A 


P  ^(s)M(s) 
M(s) 

1 

M(_s) 


(5.5) 


In  the  presence  of  a  single  sinusoidal  component 
at  the  input 

r  =  roSin{oot),  (5.6) 


(5.2)  reduces  to 


AU<P)  = 


d 

2co* 


Am(/t^o) 

X<pij(^o) 


(I  +  B(<p)) 


X  Re  [Ht^iioo)Hl^rUoo)M{joo)] 


x(l  +  B^i<P)). 


(5.7) 


The  averaging  theory  is  based  on  the  assumption 
that  the  closed-loop  polynomial  is  stable  along 
the  trajectories  under  consideration.  However, 
since  the  expression  (5.7)  leads  to  a  rational 
(matrix)  function  of  <p,  the  averaged  system  can 
be  defined  by  continuation  for  all  <p  except  those 
values  corresponding  to  poles  of  the  closed-loop 
system  at  j  =  joo. 

Figure  19  shows  the  responses  of  Cq  with  in 
the  case  of  the  first  example.  The  simulations  for 
the  averaged  system  started  at  t  —  3  sec  with  the 
same  values  as  the  parameters  of  the  original 
system  at  this  time  (such  that  the  inner  loop  was 
stabilized).  It  can  be  seen  that  the  responses  of 
the  averaged  system  follow  closely  those  of  the 
original  system  but  fail  to  reproduce  the  effect 
of  the  bursts.  Although  the  averaged  system  can 
be  defined  by  continuation  across  the  stability 
boundary,  we  see  that  the  assumption  of  a  stable 
inner  loop  is  necessary  for  the  application  of 
averaging. 

Figure  20  shows  the  response  of  Cq  and  in 
the  case  of  the  second  example  and  with  g  =  0. 1 
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(the  initial  values  for  the  averaged  system  arc 
the  same  as  those  of  the  adaptive  parameters 
after  50 sec).  For  this  value  of  the  adaptation 
gain,  the  original  system  docs  not  exhibit  a  burst 
and  ’  we  see  the  close  approximation  by  the 
averaged  system.  Both  systems  converge  to  an 
equilibrium  point.  Forg  =  1,  the  approximation, 
as  shown  in  Fig.  21,  is  not  as  good  and  fails  to 
predict  the  burst  (here  the  initial  conditions  were 
taken  after  40  sec).  The  limiting  theory  of 
averaging  assumes  that  parameters  vary  infinitely 
slowly,  and  therefore  eliminates  the  internal 
excitation  phenomena  described  in  Section  4. 
While  the  theory  only  guarantees  that  the 
approximation  is  valid  for  g  sufficiently  small,  we 
found  plenty  of  situations  where  the  predictions 
were  accurate,  even  for  large  g  (see  Sastry  and 
Bodson  (1989)).  This  example,  however,  is  a 
case  where  a  significant  difference  occurs  when 
the  gain  is  not  sufficiently  small.  A  rule  of  thumb 
is  that  adaptive  parameters  must  vary  slowly 
compared  to  the  reference  input.  If  this  is  not 
the  case,  we  see  that  large  deviations  can  be 
caused  by  the  interactions  between  the  inner  and 
outer  loops. 

6.  CONCLUSIONS 

In  this  paper,  we  discussed  the  existence  of  a 
phenomenon  in  adaptive  systems  that  was  close 


to  the  well-known  burst  phenomenon.  It  was 
also  characterized  by  large  transients  of  the  error 
signal,  separated  by  quiet  periods.  The  main 
difference  was  that  the  phenomenon  occurred 
in  ideal  adaptive  systems,  and  was  not  caused 
by  noise  or  unmodeled  dynamics  (or  o- 

modification).  Therefore,  we  called  this  a 
pseudo-burst  phenomenon.  By  virtue  of  the 
theoretical  properties  of  adaptive  systems,  ttie 
bursts  could  not  recur  indefinitely. 

The  explanation  of  the  bursts  in  the  first 
example  was  shown  to  be  very  similar  to  the 
explanation  for  the  burst  phenomenon.  When 
the  adaptive  parameters  reached  values  which 
made  the  inner  loop  unstable,  large  transients 
followed  and  parameters  eventually  returned  to 
the  stable  region.  In  this  case,  however,  the 
crossing  of  the  stability  boundary  was  not  caused 
by  parameter  drift  due  to  noise,  but  occurred  in 
the  normal  evolution  of  the  parameters. 

Another  example  was  described  which  also 
bore  resemblance  to  the  burst  phenomenon.  Its 
origin  was  quite  different  however.  In  this  case, 
the  parameters  converged  to  v'alues  such  that  the 
output  error  was  nearly  zero.  These  parameters 
were  also  such  that  the  inner  loop  was  stable, 
and  such  that  the  outer  loop  itself  was  stable  for 
slow  adaptation.  For  a  range  of  adaptation  gains 
however,  fast  instabilities  arose,  so  that  the 
parameters  moved  away  from  the  equilibrium 
and  a  burst-like  response  of  the  output  error 
followed.  Eventually,  the  adaptive  parameters 
converged  to  other  values  which  stabilized  the 
system.  The  phenomenon  was  explained  by  a 
peculiar  interaction  betweeen  adaptation  and 
control. 

The  second  example  may  be  contrasted  with 
the  example  of  Shimkin  and  Feuer  (1988). 
There,  an  adaptive  system  was  considered 
where,  in  one  case,  the  controller  parameters 
were  updated  continuously  and,  in  the  other,  the 
controller  parameters  were  frozen  over  (long) 
periods  of  time  and  updated  periodically.  An 
effect  of  this  procedure,  called  block  processing, 
was  to  eliminate  interactions  between  adaptation 
and  control  such  as  described  above.  In  the  case 
of  block  processing,  the  adaptive  system  was 
found  to  be  stable.  With  continuous  adaptation, 
the  system  was  unstable.  Although  the  mechan¬ 
ism  of  instability  of  the  paper  of  Shimkin  and 
Feuer  was  different  from  what  we  observed  in 
this  paper,  it  is  interesting  to  note  that  the 
interactions  between  adaptation  and  control  may 
sometimes  eliminate  the  excitation  available  at 
the  input  of  the  plant,  while  at  other  times 
introduce  additional  excitation. 

It  is  not  clear  whether  anything  can  be  done  to 
prevent  the  pseudo-burst  phenomenon  of  Ex- 
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ample  1.  Since  the  error  is  not  negligible 
between  the  bursts  in  the  first  example,  a  dead- 
zone  would  have  to  be  so  large  that  it  would 
eliminate  the  benefits  of  adaptation.  However, 
what  is  striking  about  both  examples  is  the 
incredibly  slow  convergence  of  the  algorithm: 
nearly  1000  sec  are  needed,  while  the  dynamics 
of  the  inner  loop  are  concentrated  around 
1  rad  sec“‘.  This  is  especially  surprising  given  the 
amount  of  excitation  observed  in  the  signals. 
The  slow  convergence  may  possibly  be  attributed 
to  the  poor  convergence  properties  of  the 
algorithm  used  for  adaptation.  We  did  not 
explore  alternate  algorithms  but  it  is  possible 
that  schemes  based  on  least-squares  algorithms 
would  have  better  convergence  properties  (cf. 
Sastry  and  Bodson  (1989)). 
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Abstract 

A  multivariable  model  reference  adaptive  control  algorithm  is  presented  for  the  case 
when  the  high-frequency  gain  matrix  is  unknown.  Only  an  upper  bound  on  the  norm  of 
the  matrix  needs  to  be  known  a  priori.  A  transformation  of  the  parameters,  with  a  sort 
of  hysteresis,  is  used  to  guarantee  that  a  controller  matrix,  which  is  normally  the  inverse 
of  the  estimate  of  the  high-frequency  gain  matrix,  remains  nonsingular.  It  is  shown 
that  all  the  signals  in  the  adaptive  system  are  bounded  and  that  the  tracking  error 
and  the  regressor  error  converge  to  zero  for  all  bounded  reference  inputs.  Furthermore, 
exponential  convergence  is  achieved  when  the  regressor  vector  is  persistently  exciting. 
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Introduction 

Single-input  single-output  model  reference  adaptive  control  (MRAC)  results  have  been  ex 

tended  to  continuous-time  multivariable  systems  by  several  authors,  e.y.,  Elliot  k  Wolovich 

(1982),  Singh  fe  Narendra  (1984),  and  Tao  k  loannou  (1988).  Unfortunatdy,  current  MIMO 
algorithms  require  significant  a  priori  knowledge  or  constraints  on  the  high-frequency  gam 
matrix.  Either  the  high-frequency  gain  matrix,  Kp,  must  be  known  (fully  or  partially,  e.^., 
Kp  diagonal  with  the  signs  of  the  diagonal  elements  known),  or  it  must  satisfy  some  positive 
definiteness  condition  (e.y.,  there  exists  a  known  matrix  S  such  that  SKp  >0). 

In  the  SISO  case,  the  problem  of  relaidng  the  requirement  of  knowledge  of  the  sign  of  the 
high-frequency  gain  was  first  solved  by  using  controllers  based  on  the  so-called  Nussbaum 
gain,  c/.  Mudgett  k  Morse  (1985).  Because  of  the  limited  practical  use  of  these  controllers, 
Lozano  et  al  (1990)  proposed  a  completely  different  approach.  In  their  algorithm,  the  con¬ 
troller  parameters  are  obtained  from  the  estimated  parameters  by  applying  a  transformation 
with  a  sort  of  hysteresis. 

In  the  MIMO  case,  the  problem  of  unknown  high-frequency  gam  matrix  is  even  more  dif¬ 
ficult.  The  Nussbaum  gain  approach  is  not  directly  apphcable  to  MIMO  MRAC  algorithms. 
However,  using  the  hysteresis  idea  of  Lozano  et  al  (1990)  with  some  significant  modifications, 
we  show  in  this  paper  how  to  design  a  MIMO  MRAC  algorithm  so  that  stability  is  guaran¬ 
teed  even  if  the  high-frequency  gain  matrix  is  unknown  (only  an  upper  bound  on  the  norm  of 
the  high-frequency  gain  matrix  and  an  upper  bound  on  the  norm  of  the  matrix  of  unknown 
parameters  are  needed).  We  show  that  all  the  signals  in  the  system  remain  bounded,  that 
the  output  error  converges  to  zero,  and  that  the  regressor  error  is  in  L2  and  converges  to 
zero,  independently  of  the  richness  of  the  signals  used  as  reference  inputs.  We  also  prove 
that  exponential  convergence  is  achieved  when  persistency  of  excitation  conditions  are  met. 
Aside  from  the  nontrivial  multivariable  extensions,  an  original  contribution  of  our  paper  is 
also  the  exponential  convergence  result  (only  asymptotic  stability  is  achieved  by  Lozano  et 

al  (1990)  algorithm).  i. 

Finally,  our  success  in  applying  the  hysteresis  transformation  also  suggests  that  such 

transformation  may  prove  helpful  to  solve  other  problems  where  singularity  regions  must 
be  avoided,  such  as  in  adaptive  pole  placement,  and  in  nonlinear  control  using  linearization 

techniques. 


1  Definitions  and  facts 

Definition  1  ;  Hermite  normal  form 

Morse  (1976)  showed  that  for  any  square,  strictly  proper,  and  nonsingular  plant  P{s)  G 
^p^p^s)  (the  set  of  (p  x  p)  matrices  whose  elements  are  rational  functionsof  s),  there  exists 
a  unique  matrix  H{s)  €  called  the  Hermite  normal  form,  such  that: 

P{s)  =  H{s)U{s) 

U{s)  G  5Rp’''’(5)  and  Jirn  U{s)  =  Kp  nonsingular 


1 


0 


H{s)  = 


i 


0 

1 

(s+a)-2 


_ 1 _ 

(^+a)’^P  J 


where  dh{j{s)  <  r,-  —  1  and  a  is  arbitrary,  but  fixed  a  priori. 


Definition  2  :  Projections 

Following  Stewart  &  Sun  (1990),  Chapter  I,  let  AT  be  a  subspace  of  3?”  of  dimension  r  <  n 
and  let  the  columns  of  the  orthogonal  matrix  Qx  €  form  an  orthonormal  basis  for  X, 

with  QxQx  =  I-  The  matrix 

Px  =  QxQl 

defines  the  orthogonal  projection  onto  X,  i.e.  V  x  €  3J”,  Pxx  €  X  and  (x  —  P^x)  .L  X.  It 
can  be  easily  verified  that  the  matrix  Px  €  3?”^"  is  symmetric,  idempotent,  and  independent 
of  the  choice  of  Qx-  The  matrix 

Px^I-Px 

defines  the  orthogonal  projection  onto  X-^,  the  subspace  of  3^'^  of  dimension  n  —  r^  that  is 
the  orthogonal  complement  of  X.  The  matrix  P^  G  is  also  symmetric,  idempotent, 

and  independent  of  the  specific  choice  of  Qx- 

Let  7i{A)  be  the  range  of  A,  i.e.  the  subspace  of  3?"  spanned  by  the  columns  of  the 
matrix  A.  Then,  there  always  exists  a  square  orthogonal  matrix  U  such  that  AU  =  [  P  0  ] 
with  B  full  rank  and  one  has  that 


<3k(j)  =  PniA)  = 

Definition  3  ;  Singular  value  decomposition 

Let  A  €  3?™^”  have  rank  r,  then  the  singular  value  decomposition  of  A  is  given  by  (c/. 
Stewart  &;  Sun  (1990),  Chapter  I) 

U'^AV  -  S  U^U  =  /  =  V^V 

where  the  matrices  U  €  V  G  and  S  =  diag{cr,}  with 

0*1  ^  0*2  ^  •  P  O'r  ^  O'r-i-l  -  -  .  0 

Notation  : 

Assuming  than  D{s)  G  (the  set  of  {p  x  p)  polynomial  matrices),  dr{D  denotes  the 

maximum  polynomial  degree  in  the  i-th  row  of  D{s),  dciD  the  maximum  polynomial  degree 
in  the  i-th  column  of  D{s),  and  dD  the  maximum  polynomial  degree  in  D{s).  In  the  paper, 
P{s)  denotes  the  transfer  function  of  a  linear  time  invariant  operator.  While  P[u]  denotes 
the  output  of  the  operator  (in  the  time  domain)  with  input  u{t).  Finally,  if  x  is  a  vector, 
possibly  function  of  time,  we  denote  lx(f)|,  the  Euclidean  norm  of  x  at  time  t. 
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2  Model  reference  adaptive  control 


2.1  Fixed  controller 

We  use  a  standard  controller  structure  (see,  e.,.,  Elliot  &  Wolovich,  (1982)).  Assume  that 
the  plant  is  described  by  a  square,  nonsingular,  strictly  proper,  ^d  nummum  phase  transfer 
function  matrix  P(s)  €  assume  that  the  Hermite  normal  form  of  P{s)  (see  Defimtion 

1)  is  known,  assume  that  an  upper  bound,  y,  on  the  masdmum  of  the  observability  indices. 
tL,  know.  The  equation  of  the  plant  is  y,  =  P[u]  and  the  following  controUer  structure 

is  considered 


r  =  Afo(ro] 

u  -  Cor  +  +  A“^J5[yp] 


(1) 


where  ro  €  3^"  is  a  reference  signal,  Co  C  is  nonsingular,  A(s),  C(s)  €  [s],C  s  6 

SRpxrls).  and  M„(s)  €  *'«'(»)■  is  a  proper  stable  transfer  function  matrix  and  A(s)  is 

a  diagonal  matrix  of  Hurwits  polynomials.  A(s)  _  diag{A(sp  wi  (s)  "  '  _ 

By  combining  {Wn.Bn}.  a  right  matrix  fraction  description  (MFD)  of  P(s)  (F(s)  - 
Nr(s)D^(s)),  with  the  equation  of  the  controller  (1),  the  output  y,  can  be  expres 


Vp 


=  NR{{ly-C)DR  -  DNr)  ^ACo[r] 


(2) 


Now,  let  the  reference  model  M(s)  =  H{s)Mo(s),  where  H{s)  is  the  Hermite  normal  form 
P(s)  The  model  output  =  HAd„(r„l  =  H\r\.  It  can  be  shown  (see,  e.y.  Sastry  &  Bodson 

(1989))  that  3C„-  e  C-(a).  6 


JVr((A  -  C-)Dr  -  D-lVnl-’AC;  =  H 


(3) 


so  that  model  matching  is  achieved,  with  A-'D  proper,  and  K-'C  strictly  proper  In  par- 
lir  «  =  /?-■  noiingular.  BD'  <  w  -  1,  and  Br^C-  <  5A,.  It  can  abo  be  shown 
c7  S  Mathelin  &  Bodson  (1992)),  that  the  matrices  C-„C%D-  are  unijiie  if  we  assume 
knowledge  of  the  observability  indices  {n,)and  add  the  following  constraint 

BcD'  <  K  -  1  v;  ('•) 


2.2  Adaptive  controller 

In  order  to  implement  adaptation  in  the  model  reference  design,  we  must  estimate  Co*,  C*,  D\ 
which  satisfy  the  matching  equahty.  The  matching  equality  (3)  is  equivalent  to 


I  =  Co*i?"^/"  + A~^C*  + A  ^D‘P 


(5) 


Define  L{s)  =  diag{/(s)},  with  l{s)  Hurwitz,  dl{s)  >  d  where 
elements  of  H-\s).  Then  multiplying  both  sides  of  (5)  by  L 
function  matrices  to  u  leads  to 


d  is  the  maximum  degree  of  all 
and  applying  both  transfer 


L-\u]  =  C;{HL)-'^[yp]  +  L-\A  'C*[u]  +  A  'C‘[yp]) 


(6) 
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Since  H{s)  is  known  a  priori,  (6)  is  an  equation  where  the  unknown  parameters  appear 
linearly  which  can  be  rewritten  as 

L-^[u]  =  G;iHL)-^[yp]  +  (7) 

where  tj)  is  the  regressor  vector  and  6*  is  the  matrix  of  unknown  controller  parameters  whose 

p  first  columns  are  equal  to  Cj.  The  following  error  equation  can  be  derived 

62  =  6^^  -  L-\v\  =  {6^  -  (8) 

where  6  is  the  estimate  of  6*  and  <f)  is  the  parameter  error.  At  this  point,  6  can  be  computed 
using  a  standard  estimation  algorithm,  such  as  the  least-squares  described  in  the  next  section. 


2.3  Parameter  identification  algorithm 

We  will  assume  that  the  following  normalized  least-squares  identification  algorithm  with 
covariance  resetting  is  used: 


=  0  = 
P  =  -g 


1  + 
Pxl^iP'P 


with  5^,7  >  0 


with  -P(O)  =  P{jk)  =  >  0 


1  + 

where  62  is  the  identifier  error  (8)  and  {n}  are  the  resetting  time  instants  such  that 

{n}  =  {rJt|A^^^(P(r^■■))  =  h  with  0  <  h  <  ko] 


(9) 


where  denotes  the  largest  eigenvalue.  The  matrix  P  is  discontinuous  at  the  resetting 
instants  r^,  with  the  limit  on  the  right  Pfrjt)  equal  to  a  predetermined  matrix  koT.  If  ki  =  0, 
the  standard  least-squares  algorithm  without  resetting  is  obtained.  For  ki  >  0,  the  algorithm 
wiU  reset  whenever  the  P  matrix  becomes  too  small. 


Definition  4  :  Persistent  Excitation 
A  bounded  vector  ^  is  persistently  exciting  (PE)  iff 


3a>0  3^>0  such  that 


> 


al 


Vto  >  0 


Definition  5  ;  Sufficient  Excitation 
A  bounded  vector  ^  is  sufficiently  exciting  (SE)  iff 

3  a  >  0  such  that  V  <0  >  0  3  <^(fo)  >  0  such  that 


dr  >  al 


Note  that  if  a  signal  is  PE  then  it  is  also  SE,  but  the  converse  is  not  true. 
Lemma  1  ;  Properties  of  the  estimation  algorithm 

Assuming  that  tp  G  Looe,  the  estimation  algorithm  (9)  has  the  following  properties 
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1.  0  <  P  <  kol,  e  Looo  and  (1^::;$^’  ^  i+lMl-  ^ 

2  _  rjt)  is  bounded  below  V  k. 

3.  4>,  4,P=  i+tl4l~  ^ 

4-  €  ^2- 

5  If - it'-rrm  is  not  SE  or  if  A:i  =  0  (no  resettings): 

•  {rjt}  is  a  finite  set. 

•  P  £  Li- 

•  P(t)  and  converge  to  some  Poo  and  <;&oo- 

6-  If 

•  limt_oo  <f>{i)  =  0- 

7.  If  '=  PE  and  k,  >  0: 

•  limj_oo  </’(0  =  0  exponentially. 

The  proof  is  in  de  Mathelin  (1993). 


The  specific  adaptation  algorithm  is  chosen  for  its  special  properties  given  m  Lemma  1 
Most  important  is  the  fact  that  the  parameter  error  cf>  converges  (although  not  ^^cessari  y 
to  zero)  independent  of  richness  properties  of  xp.  It  may  converge  to  zero  or  to  some  other 
value.  Also,  if  ^  is  not  SE  or  if  fci  =  0  (no  resettings)  P^  wiU  have  the  same  ^ 

e.  and  there  will  be  a  finite  number  of  resettings.  These  properties 

A  ( P(f))  for  the  resetting  instead  of  A„,„(P(0)  a^  Sastry  &  Bodson  (1989).  When  there 
is  not  suLent  excitation,  the  algorithm  will  not  reset  the  covarian<^  after  some  time  and 
the  algorithm  has  the  same  properties  as  when  covariance  resetting  is  not  used  Finally 
should  be  noted  that  the  exponential  convergence  under  PE  conditions  is 

than  asvmptotic  convergence)  when  the  covariance  resetting  is  used.  However,  the  stability 
results  3^be  valid  whether  the  resetting  is  used  or  not  and  whether  there  is  excitation  or 

not. 


3  Hysteresis  transformation 


AlthouRh  it  might  not  be  obvious  «  priori,  the  development  of  a  stability  proof  for  the  MRAC 
Wm  reveils  that  the  parameter  C„  in  the  control  law  (1)  must  be  -ns^u  a-  Mo 
nreciselv  the  matrix  Co"'  must  remain  bounded.  Since  this  property  is  not  guaranteed  y 

algorithm  (9),  the  parameter  C„  used  in  «ae  oon troUaw 
be  dissociated  from  the  estimate  C„.  Hereafter,  we  w.U  use  the  subscr.pt,  c,  to  mark 
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controller  parameters,  an  opposed  to  the  estimated  parameters.  So,  instead  of  being 
equal  to  the  estimate  Co,  a  transformation  is  applied  to  make  sure  that  Co^  tas  a  bounded 
inverse  even  if  Co  is  close  to  singularity. 


3.1  Necessary  properties 

To  complete  the  proof  of  stability  and  convergence  of  the  MRAC  algorithm,  one  finds  that 

the  following  properties  are  required  from  the  parameter  transformation; 

1.  C'oT  ^  I'oo- 

2.  6c  e 

3-  Pc  -  riSfc  ^  -^2  n  where  4>c  =  6c-  e\ 

4.  If  lim^^oo  =  0  then  lim^^oo  <f>cii)  =  0- 

5.  If  P{t)  and  converge  to  some  Poo  and  <^oo  then  converges  to  some  <f>c^. 

Since  the  transformation  that  we  propose  to  achieve  this  objective  is  complex,  we  proceed 
to  present  the  transformation  in  several  steps. 


3.2  Simple  hysteresis 

Define  Coj{t]  as  being  equal  to  Co(t)  when  <Tnun(Co)  is  sufficiently  large,  and  staying  constant 
when  o-„un(C'o)  becomes  too  small  (a^n  denotes  the  smallest  singular  value).  More  precisely, 
since  an  upper  bound  on  \Kj,\  is  known,  a  lower  bound,  tr,  on  cTminfCo)*  also  known.  Let 
o-mm(C'o(0))  >  cr,  and  define  Cq^  such  that  the  relationship  between  and 

cITbe  described  by  Fig.  1.  Define  {h}  as  the  time  instants  when  the  value  of  Co,  becomes 
frozen  and  {Pt}  as  the  time  instants  when  a  jump  in  the  value  of  Co,  occurs  (when  Co,  is 
unfrozen  and  becomes  equal  to  Cq  again).  The  levels  at  which  freezing  and  unfreezing  occur 
are  chosen  to  be  different  to  prevent  repeated  switchings  arbitrarily  closely.  Precisely  stated, 
we  let  crroin(C'o(0))  >  cr,  To  =  0,  and  V  A:  >  1 


tk  is  such  that 


Tk  is  such  that 


(^nia{Co{t))  >  0-/&  V  Tk-1  <t  <tk 

cr^niCoiT,))  =  a/a 
amiii(C'o(0)  <  a/a  V  h  <  t  <  Pjt 


(10) 


In  Fig.  1,  the  parameters  a  and  b  are  respectively  equal  to  2  and  4,  but  other  values 
can  be  selected,  provided  that  1  <  a  <  6  <  oo.  Note  that,  since  Co{t)  is  continuous, 
tk<Tk<  h+i  <  Tk+i  Vfc  >  1  and 


a„^„(C'o(t))  >  a/6  if 
a™„(C'o(0)  <  if  h<t<Tk 


The  relationship  between  Co,{t)  and  Co{t)  is  described  by 

Co{h)  if  h<t<T, 


(11) 
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so  that  a^^iCojit))  >  (r/b  V  t.  The  following  parameter  transformation 

Co.  =  Co,  0,^6  (12) 

would  then  appear  to  be  adequate  at  this  stage.  However,  it  can  be  checked  that  the 
transformation  (12)  does  not  guarantee  property  3  of  section  3.1,  although  it  guarantees  all 
the  others.  Therefore,  the  following  additional  transformation  is  necessary. 


3.3  Simple  hysteresis  +  projection 


To  solve  the  problem  of  guaranteeing  property  3,  the  matrix  6c  in  the  controller  is  also 
transformed,  using  the  following  parameter  transformation 


Gc{t)  = 


'  m 

.  e{i)  +  -  cm) 


if  Tk-1  <t<tk  Vfc  >  0 
if  tk<t<Tk  Vfc  >  0 

(13) 


where  Pp  is- the  rectangular  matrix  made  of  the  first  p  columns  of  P .  Since  P  is  symmetric,  P^ 
is  the  matrix  made  of  the  first  p  rows  of  P.  It  can  be  checked  that,  with  this  transformation, 
Co.  =  Co,.  Further,  the  transformation  is  well-defined  and  all  the  necessary  properties  of 
section  3.1  are  guaranteed  as  long  as 


3e>0  s.t.  <r^n{Pp{t))>e  Vh<t<Tk 


(14) 


In  the  SISO  case,  it  can  be  proved  that  condition  (14)  is  always  satisfied,  so  that  the  parame¬ 
ter  transformation  (13)  is  always  well-defined  (it  is  the  transformation  presented  in  Lozano  et 
al  (1990)).  Indeed,  given  the  least-squares  estimation  algorithm  (9),  ^(F~^^)  =  0  between 
resettings,  from  which  one  can  deduce  that 


cm=c;'^+pj{t) 


ko 


(15) 
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where  tj  is  the  last  covariance  resetting  time  instant,  i.e.  rj  <  t  <  rj+i-  Therefore, 

<r™(C’-(())><r-<.^{P7(<))^  (16) 

In  the  SISO  case,  Pp  is  a  vector.  Based  on  (16),  if  for  some  t  >  0,  1-Pp(i)l  ~~ 

then  the  scalar  C'o(t)  is  such  that  lC'o(t)|  >  o-j a  and  3  k  >  1  such  that  Tk-i  ^  t  <  tk. 

Therefore,  condition  (14)  is  always  satisfied. 

Unfortunately,  in  the  MIMO  case,  there  is  no  guarantee  that  condition  (14)  is  respected 
V  t  >  0.  Indeed,  if  there  is  excitation  in  some  directions  but  not  others,  o'inin(-fj)(t))  could 
tend  to  zero  while  a-,n^x(Pp(<))  would  not.  Therefore,  one  could  conceive  that,  given  cer¬ 
tain  initial  conditions  and  given  certain  reference  signals,  limi_oo  frnan(-Pp(t))  =  0  with 
llm£_.oo  <rmin(C'o(t))  <  cr/^-  III  that  case,  the  transformation  (13)  would  not  be  well-defined 
Vt>0.  Therefore,  property  2  of  section  3.1  would  not  be  guaranteed  for  this  transformation 
in  the  MIMO  case. 


3.4  Selective  hysteresis  -h  projection 


To  solve  the  problem  of  the  pseudo-inverse  Pp{Pp  Pp^^  not  being  necessarily  bounded  in 
the  MIMO  case,  an  additional  modification  is  brought  to  the  parameter  transformation. 
Essentially,  the  problem  is  that  if  the  estimated  matrix  Cq  persists  in  being  singular,  despite 
the  presence  of  excitation  in  certain  (but  obviously  not  all)  channels,  then  the  matrix  Cq^ 
must  be  updated  selectively  in  the  directions  where  there  is  excitation.  The  major  challenge 
is  to  achieve  this  objective  while  maintaining  the  boundedness  of  Cq^  and  the  properties 
listed  in  section  3.1. 

We  consider  the  following  transformation 


0c{t) 


■  eit) 

e{t)  -h  p{t)Vp{t)^p\t)irf{t){ciit)Rj.{t)  -  cut)) 


where  VpT^p^Up  is  a  lower  rank  approximation  of  Pp{l^Pp) 
following  partioned  singular  value  decomposition  of  P^ 


if  Tk-i  <t<tk  Vk>0 
if  tk  <  t  <  Tk  Wk>  0 

(17) 

Specifically,  we  consider  the 


[Uf  U.f Pj[Vp  K]  = 
such  that 


T.P  0 

0  E, 


=  E  =  diag  {cr,}  cr,-  >  aj  i  <  j  (18) 


Ep  = 


CTi  0  .  0 

0  <T2  . 

0  .  0  crpfp  J 


>  +  Sp)I  >  {€p  +  Sp)I 


(^Np+l  0  .  0 

ep {p  —  Np  —  l)5p  0  .  0 

E.  = 

.  o-p-i  0 

0  .  0  CTp 

< 

.  .  cp  "h  Sp  0 

0  -  0  cp  . 

<  (cp  +  (p  —  Np  ~  l)^F)-f 


(19) 
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N,  is  the  size  of  Sp  and.  therefore.  Up  6  Vp  S  W.  €  Sr><(r-rrri,  and  K  6 

^2pux{v-Np) ^  <piie  constants  cp  and  Sp  are  arbitrarily  chosen,  but  must  satisfy  the  following 

conditions 


0  <  <  (1  -  1/a) 

0  <  Sp^epjocp 


crko 


(1  +  (p  —  l)/c(p)(cre.  +  l^(O)l) 


with  ocp  >  1 


(20) 


where  <r,.  is  a  known  upper  bound  of  |«-|.  BasicaUy,  2.  contains  the  singular  valu«  of 
no  greater  than  ep  and  the  ones  which  are  suffiaently  close  to  them  (by  increment  Sp). 
contains  the  larger  singular  values  such  that  (r^nC^p)  >  +  ^P-  The  difference  Sp  is 

found  to  be  required  to  guarantee  continuity  in  the  proof.  ^  n  TrT\ 

Pinallv  =  VfUl  where  Vt  is  an  orthonormal  basis  of  the  subspace  dT  ==  l<.[UojUpUp) 

of  size  Np  and  Uo  is  an  orthonormal  basis  of  the  subspace  =  Tl{CoU^Uj)^  also  of  size  Np. 
Therefore,  given  the  properties  of  projections  (c/.  Definition  2) 


V[Vf  =  I 
UjUo  =  I 


and 

and 


VjVj-  =  P: 


7l{CojUpU-f) 


UoUl  =  P^,a.v.vr,  ^  Py  G 


(21) 


If  then  Np=p.  and  Px  =  Py  =  P  « ‘r^AP^’W)  <  ‘p  fVp  <  p  and 

Px  and  Py  are  given  by 


Px  -  Co,Up{UfClCojUp)-^U^Cl 
Py  =  {I-CoU,{UjCjCoUp-^UjC^) 


(22) 


Indeed,  referring  to  Definition  2,  we  let  A  =  CojUpU^  and  t/  =  [  Up  U,],  so  that  B  - 
Co, Up  and  Px  is  given  by  (22).  This  assum^  that  B  =  Co,Up  ^s{n\\  colurm  ran 
i.e.  UlCiCo  Up  nonsingular.  However,  since  Co,  is  nonsingular,  (C/p Go,, Dp  j  is  well 
defined  and  the  number  of  columns  of  Vj  is  equal  to  Np.  A  similar  derivation  appHes  for 
Py.  From  (15),  it  can  be  checked  that 


uJeUi)  =  ujcf  +  ujpj{t)^  =  ujc;  +  E.K- 


=  ujcf  + 

ko  ^0 


(23) 


where  Tj  is  the  last  covariance  resetting  time  instant,  i.e.  tj  <  t  <  t,-+i.  Therefore,  given 
(19)  and  (20) 

>  u  -  (ep  +  (p  -  Wp  -  >  <’I<^  (2<) 

SO  that  {UjC^CoU^Y^  is  weU-defined  and  the  number  of  columns  of  Uo  is  indeed  equal  to 

Np. 

Comments:  j  rr  in  r 

To  guarantee  some  continuity  properties  and  the  uniqueness  of  Vf  and  Do,  the  ^rarn- 

Schmidt  orthogonalization  procedure  with  memory  described  in  the  appendix  is  applied  to 

Px  and  Py  to  compute  Vj  and  Uo  respectively.  Using  this  procedure,  we  also  have  that  it 

>  ^P  then  iVp  =  p,  ^^(0  ==  /,  VpmYitWit)  =  P.(C)(P/'(0i^r(0)-^ 

the  parameter  transformation  (17)  is  simply  identical  to  the  parameter  transformation  (13). 
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It  can  also  be  shown  that  Np  >  1-  Indeed,  if  3  t  >  0  such  that  Np  =  0,  then  from 
(24),  a^(Co)  >  cr/a.  Therefore,  3  k>0  such  that  T^-i  <  t  <  4  and  0,(0  -  ^(f)- 
other  words,  if  Pp  was  to  be  small  in  all  directions,  then  it  would  mean  that  there  had  been 
enough  excitation  so  that  Co  would  be  close  to  and  one  would  be  out  of  the  singularity 
region.  Therefore,  Np  >  1,  ^-^(Sp)  >  cp  +  Sp  and  the  parameter  transformation  (17)  is 
weU-defined.  BasicaUy,  the  inverse  (P^^Pp)"^  has  been  replaced  by  a  lower  order  inverse, 

whose  existence  and  boundedness  can  be  guaranteed. 

Finally,  it  can  be  verified  that  Cq^  G  Poo-  Indeed,  if  Tk-i  ^  t  <  tk  ov  i(  tk  <  t  <  Tk  and 
>  £p,  then  Cj^(0  =  Cjfi)  whose  inverse  is  always  bounded.  If  4  <  t  <  T*  and 

cTrr^iPj)  <  ep,  then 

=  cUt)  +  P^it)Vpit)Y;^KiWUi)iol{i)RU^^ 

=  c^(t)  +  Up(t)uUt)icl{t)RUi)  -  OUi)) 

=  Up{t)Ul{t)Cl{t)Rl{t)  +  U.[t)Uj{t)C'^{t)  (25) 


Therefore,  using  (21)  and  (22), 

ClCo.  =  UpUlClVjVjCo.UpUl  +  U.UjClCoU.U] 
=  [Up  P.  ] 


U^ClCo.Up  0 

0  UjC^CoU, 


UT 


(26) 


so  that  <7mIn(PoJ  > 

Finally,  given  (17)  and  all  the  previous  definitions,  the  parameter  transformation  with 
hysteresis  will  have  the  following  form 

0,(t)  =  0(f)  +  P(OQ(O  (2"^) 


with 


'  0 

Vp{t)E-^\t)Ufit)(Cl{t)Rpii)  -  cm) 


if  Tk-i  <t<tk  Vk>0 
j  tk<t<Tk  Vfe  >  0 
^  1  and  cr^n{Pj{i))  >  ^P 

■  r  \  tk<t<Tk  V^:  >  0 
^  I  and  cr^n{Pj{t))  <  ^P 


(28) 


Lemma  2  ;  Properties  of  the  hysteresis  transformation 

Assuming  that  ^  G  Looc  and  that  the  parameter  estimation  algorithm  is  defined  by  (9),  then 
1.  The  transformation  is  always  uniquely  defined. 


2.  Cq^  G  Loo- 

3.  Q  G  Poo,  6a  G  Poo,  and  </>,  G  Poo,  where  <f>o  =  6o-  0\ 


4.  Ha 


i+HM 


It-  G  P2  n  Pc 


5.  If  lim<^oo  m  =  0  then  limi^oo  ^c(0  =  0- 
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6.  (fjt  -  Tjb_i)  and  (Tjt  -  h)  are  bounded  below  Wk. 

7.  {Tfc}  and  {ffc}  are  finite  sets. 

8.  If  P(()  and  ^(t)  converge  to  some  P„  and  then  Ut)  converges  to  some 
The  proof  is  in  de  Mathelin  (1993). 


0  e  is  obtained  because,  even  though  P-'it)  is  not  ne^^ly  boundrf 
as  t  increases  Ej'  exists  and  is  always  bounded  when  Q  5«  0.  In  other  words,  should  P  be 
rnbrunded.  either  E?'  wiU  continue  to  exist  or  the  algorithm  wiU  come  out  of  the  hysterests 
region.  Th;  fourth  property  is  most  important:  it  shows  that  the  mam  property  on  ^  m 
the  estimation  algorithm  remains  valid  when  the  climated  parameter  error 
bv  the  controller  parameter  error  dr-  The  fact  that  the  intervals  (tr  - 

Le  bounded  below  comes  from  the  normalized  nature  of  the  adaptation  algorithm  and  from 
the  separation  of  the  freezing  and  unfreezing  levels  a  and  b.  It  ehrmnates  the  possihih  y 
"rhavLg  an  infinite  number  of  jumps  in  a  finite  interval  of  time  The  seventh  proper  y 
tells  us  that  there  is  a  finite  number  of  passages  in  the  hysteresis  loop.  After  a  while,  the 
Jgmithm  will  settle  and  no  more  jumps  will  occur.  The  property  is  the  ^nsequence  of 
the  convergence  of  d  (not  necessarily  to  zero).  The  algorithm  could  actually  settle  inside 
the  hvsteriis  loop.  In  that  event,  we  will  see  in  the  following  section  that  the  stability 
properties  of  the  adaptive  algorithm  are  preserved.  The  last  property  is  obtained  becajise  o 
the  difference  6p  between  Ep  and  E<  and  because  of  the  use  of  the  particular  Gram-Schmidt 
orthogonalization  procedure  defined  m  appendix. 

Note  that  from  (27)  and  (28)  with  (25), 


umclit)  =  u^cimUt) 

ul{t)cl(t)  =  c/f(t)cJ(<)  =  cfJ(t)Co-’'(')  +  Sr(0v7(0^ 


where  n  is  the  last  covariance  resetting  time  instant.  Basically,  this  particular  parameter 
Tranmon  unfreezes  Co.  in  the  directions  where  (Cj  -  Co^)  is  sufficiently  small,  using 
the  knowledge  that  there  has  been  sufficient  excitation  in  those  directions. 


4  Stability 

If  we  define  the  model  signals,  V’m,  as  the  signals  ^  when  <j>  =  0,  then  we  can  define  the 
regressor  error,  e-^,  as  =  t/’  —  V’m- 


Theorem  1  :  Stability  of  the  MRAC  system 
Consider  the  MIMO  MRAC  system  with  the  parameter  estimation 
teresis  transformation  described  previously.  If  the  reference  input  r 
continuous,  then 


algorithm  and  the  hys- 
e  Loo  and  is  piecewise 


•  All  states  of  the  adaptive  system  are  bounded  functions  of  time 


•  The  output  error  cq  —  y-p  —  Vm  ^  ^oo  and  hmt_oo  Co  0. 
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•  The  regressor  error  —  ip  —  ipm  €  Too  H  and  limi_oo  =  0. 

The  proof  is  an  extension  of  the  SISO  proof  of  Sastry  &  Bodson  (1989)  and  can  be  found  in 
de  Mathelin  (1993). 

5  Example 

Let  the  plant  be  a  (2  x  2)  system  of  order  4,  with  the  following  transfer  matrix 

3^ +3^2+31+2  3«+4i^+6s2+Ss+2 

P{s)^ 

1  3^ +35^ +33+1 

33+33^+33+2  3< +4+* +63^+53+2 

Let  A(s)  =  diag{(s  +  A)},  L{s)  =  diag{(f  +  1)},  and  M{s)  =  H{s)  =  diag{^}.  Then, 
1  2 

c*  =  /<"-!  =  ^  ^  and  <Tn^n(C'o)  =  0.4142.  We  present  simulations  of  the  adaptive 

system  with  and  without  the  hysteresis  transformation  to  show  how  important  the  parameter 
transformation  algorithm  is  for  convergence  and  stability.  In  our  simulations,  the  different 
parameters  are  set  to  the  following  values:  A  =  4,  /  =  10,  5-  =  10,  7  =  0,  Iro  =  1,  ki  —  0, 
Jc2  =  0,  cr  =  0.1,  a  =  2,  6  =  4,  Cp  =  10“'’,  cep  =  2,  and  the  initial  estimate  of  the 

parameters  ^?(0)=  ^  -0  5000000'  f^min(C'o(0))  =  0.5  >  tr. 

The  reference  inputs  rj  =  sin(5t)  +  sin(7t)  +  sin(100  and  r2  =  sin(6t)  +  sin(80  +  sin(9t) 
contain  6  frequency  components  each  and,  consequently,  are  sufficiently  rich  to  guarantee 
persistency  of  excitation  (c/.  de  Mathelin  &  Bodson  (1992)). 

In  the  first  simulation,  the  adaptive  control  algorithm  has  been  implemented  with  the 
hysteresis  transformation.  The  evolution  of  the  norm  of  the  output  error,  eo  =  yp  —  j/m,  is 
shown  in  Fig.  2.  After  a  transient  of  about  20  seconds,  the  output  error  rapidly  converges 
to  zero.  The  singular  values  cr,n;n(CoJ  and  <rmin(C'o)  are  compared  in  Fig.  3  and  Fig.  4 
for,  respectively,  the  first  5  and  20  seconds  of  the  simulation.  One  sees  that  the  system 
goes  three  times  into  the  hysteresis  before  finally  settUng  outside  the  hysteresis  region.  The 
time  instants  (in  seconds)  for  entering  and  leaving  the  hysteresis  are  ti  =  0.165,  Ti  —  0.208, 
t2  =  0.509,  T2  =  1.762,  U  =  2.224,  and  -  16.319.  It  can  also  be  seen  that  aniin(C'o)  goes 
to  zero  6  times  during  the  simulation  before  converging  toward  cr,nin(C'o)  =  0.4142  (see  Fig. 
5).  During  the  simulation,  cr^{Pj)  stayed  above  ep  inside  the  hysteresis  region. 

Finally,  the  same  simulation  was  executed  without  the  parameter  transformation  algo¬ 
rithm.  The  norm  of  the  output  error  is  shown  in  Fig.  6  and  crn,in(C'o)  in  Fig.  7.  Clearly, 
after  a  first  quick  passage  through  the  zero  region,  cr„un(C'o)  converges  into  the  zero  region 
and  does  not  leave.  The  control  parameter  matrix  Co^  =  Cq  settles  in  a  singularity  region 
and,  consequently,  suppresses  part  of  the  reference  input  excitation.  Model  matching  is  now 
impossible  and,  as  Fig.  6  clearly  shows,  the  output  error  does  not  converge  to  zero. 
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Appendix 

Gram-Schmidt  orthogonalization  with  memory 
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Let  W{t)  E  be  a  matrix  of  rank  r  function  of  time  and  let  ^  >  0  be  a  constant  to  be 
fixed  later. 

1.  At  time  f  =  0,  apply  the  following  procedure: 

Wj^i)  =  ^f^Hh  column  of  s.t. 

(VKjt(i)l  =  maxjtlPkfcl  and  minimum 

(<.\  =  0) 

W^2)  =  column  of  s.t. 

=  maxjt  and  minimum 

(lyw  =  iyp>  =  0) 

column  of  s.t. 

=  maxjt  and  A:h)  minimum 

==  0 

Then,  the  matrix 

■^(0)  =  1  ife  •  •  •  ife  ] 

is  an  orthogonal  basis  of  7^(1T(0)),  the  space  of  dimension  r  spanned  by  the  columns  of 
W^(0).  The  order  of  selection  of  the  columns  of  1T(0),  *  =  1,  •  •  •  >  r  is  uniquely 

defined  by  this  procedure.  Therefore,  the  matrix  Ar(0)  is  also  uniquely  defined. 

2.  Keep  the  initial  order  of  selection  of  the  columns,  for  the  orthogonalization 

of  W{t),  t  >  0,  until 

3  t  =  ti(]V)  >0  31  <j'<r  3l<l<p  s.t.  +  A  < 

where  =  W.  Then,  at  f  =  fi(lT)  the  procedure  defined  for  f  =  0  is  apphed  for 
the  computation  of  X(ti),  and  a  new  order  of  selection  of  the  columns, 
is  found. 

3.  Finally,  continue  this  procedure  for  the  orthogonalization  of  kF(f),  t  >  ti(W).  The 
set  {tit(kF)}  are  the  time  instants  when  the  order  of  selection  of  the  columns  of  W  is 
changed.  The  matrix  A(t)  will  be  uniquely  defined  for  all  t  >  0. 

The  procedure  is  applied  with  W  =  Px,  leading  to  X  =  V},  and  similarly  to  Py,  leading  to 
Uq.  The  advantage  of  using  this  Gram-Schmidt  orthogonalization  with  memory  is  that  the 
matrices  Vj  and  Uq  are  uniquely  defined  for  given  matrices  Px  and  Py.  The  constant  h  is 
chosen  sufficiently  small  that  no  vector  K  may  become  equal  to  zero,  while  preventing  the 
order  of  selection  of  the  columns  to  change  an  infinite  number  of  time  during  a  finite  time 


Kx 

f7(i) 

¥2 

iy(2) 


=  (/ 


Wf.{i)  where 


)W 


Kl 


-  {I- 


where 

Y?Y2 


Yr  —  where 

l^(ri  . 


Y 


yjy 
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interval.  Note  tliat  using  the  Gram-Schmidt  orthogonalization  procedure  with  memory  for 
Px  =  I  or  Py  =  I  leads  to  Vf  —  I  and  Uq  =  I.  Finally,  it  can  be  shown  (c/.  de  MatheHn 
(1993))  that  if  ^  then  the  Gram-Schmidt  orthogonalization  with  memory  is  always 

weU-defined  for  Vj  and  Uo-  In  other  words,  lh;(01  >  0  V  <  >  0,  V  z. 


